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PREFACE 


The title of this volume is an abbreviation for the more 
properly descriptive one: “Topics in the theory of approxi- 
mation”. It is a brief essay in a field on which an encyclopedia 
might be written. On the personal side, it is an account 
of certain aspects and ramifications of a problem to which I 
was introduced at an early stage, and which has given direction 
to my reading and study ever since. 

One day about twenty years ago I was admitted to the 
study of Professor Landau, seeking advice as to a subject 
for a thesis. After some preliminary inquiries as to my 
experience and preferences, he handed me a long sheet of 
paper, and directed me to take notes as he enumerated some 
dozen or fifteen topics in various fields of analysis and number 
theory, with a few words of explanation of each. He told 
me to think about them for a few days, and to select one 
of them, or any other problem of my own choosing, with the 
single reservation that I should not prove Fermat’s theorem, 
an injunction which I have observed faithfully. Guided partly 
by natural inclination, perhaps, and partly by recollection 
of a course on methods of approximation which I had taken 
with Professor Bécher a few years earlier, I committed myself 
to one of the topies which Landau had proposed, an investi- 
gation of the degree of approximation with which a given 
continuous function can be represented by a polynomial of 
given degree. When I reported my choice, he said meditative- 
ly, in words which I remember vividly in substance, if not 
perfectly as to idiom: “Das ist ein schönes Thema, ich beneide 
Sie um das Thema... Nein, ich beneide Sie nicht, aber es 
ist ein wunderschönes Thema!” It is in fact a problem which 
admits a surprising variety of interesting developments on 

il 
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its own account, and offers a natural avenue of approach 
to a number of fields of still broader importance. 

Although delayed in its completion by the conflict of other 
duties, the following exposition is substantially in the form 
in which it was projected at the time of the Colloquium 
lectures in 1925, and presented in abstract in the lectures 
themselves. One section, on the vector analysis of function 
space, originally designed for inclusion in the Colloquium, 
has meanwhile been published separately instead. The sections 
which had been written at full length in September, 1925 
— practically the whole of the first chapter, parts of the 
second, and most of the third — have been left unchanged, 
except in minor details. The elementary account of Legendre 
series in Chapter I, for example, was written before the appear- 
ance of the admirable article on the subject by M. H. Stone 
in vol. 27 of the Annals of Mathematics. A few other articles 
published since 1925 are mentioned in the text. 

For the most part, however, citations of the literature have 
been omitted. The preparation of a really adequate bibliog- 
raphy would have been a task of such magnitude as to 
delay the publication indefinitely. References to some of 
the most important papers of not too recent date are con- 
tained in my thesis (Göttingen, 1911) and in my report 
on The general theory of approximation by polynomials and 
trigonometric sums in vol. 27 of the Bulletin of the American 
Mathematical Society. Among publications in book form 
supplementing the material given here, mention should be 
made of Borel’s Leçons sur les fonctions de variables réelles 
et les développements en séries de polynomes, de la Vallée 
Poussin’s Leçons sur l'approximation des fonctions d'une 
variable réelle, and S. Bernstein’s Leçons sur les propriétés 
extrémales et la meilleure approximation des fonctions analytiques 
dune variable réelle, all appearing in the Borel series. As 
to the content of these lectures themselves, there are many 
points where it would be difficult now to recall the original 
sources either of specific results and proofs or of suggestions 
as to method. To the extent that the work is my own, some 
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parts have been published previously, in my thesis, in various 
articles in the Transactions of the American Mathematical 
Society, and elsewhere; other parts are now offered in print 
for the first time. Numerous detailed acknowledgments, not 
repeated here, have been made in the pages of the earlier 
publications. In connection with Chapter IV, reference should 
still be made to the work of Faber on trigonometric inter- 
polation in his memoir Über stetige Funktionen (zweite Ab- 
handlung) in vol. 69 of the Mathematische Annalen. My 
acquaintance with the statistical formulas discussed in Chapter V, 
which might have come from any of a variety of sources, was 
in fact mostly obtained from Yule’s Introduction to the The- 
ory of Statistics. The lemma on which the method of Chapter TH 
depends is derived from the most important single memoir in 
the literature on degree of approximation, S. Bernstein’s epoch- 
making prize essay of 1912, with which the present work also 
has other points of contact. And in conclusion it should be 
said that my study of the problem has been dominated from 
the beginning not only by the influence of my own teachers, 
but also by the writings of Lebesgue and de la Vallée Poussin. 


October 1, 1929 
DUNHAM JACKSON 
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CHAPTER I 


CONTINUOUS FUNCTIONS 


Introduction 


Weierstrass first enunciated the theorem that an arbitrary 
continuous function can be approximately represented by a 
polynomial with any assigned degree of accuracy. The 
theorem may be stated with precision in the following form: 

If f(x) is a given function, continuous for a < x < b, and 
if e is a given positive quantity, it is always possible to define 
a polynomial P(x) such that 


\f(2)—P(@)| <e 
for a<2z<b. 
To Weierstrass is due also the corresponding theorem on 
approximation by means of trigonometric sums: 
If f(x) is a given function of period 21, continuous for 
all real values of x, and if e is a given positive quantity, it 
is always possible to define a trigonometric sum T (x) such that 


f@—T@|<e 


Jor all real values of x. 
By a polynomial is meant an expression of the form 


ot a1 e+ aea27+ ++. tar" 


This expression will be said to represent a polynomial of 
the nth degree, not only when a, is different from zero, but, 
in distinction from the usage which prevails in some parts 
of algebra, also when a, = 0. That is to say, the words 
“polynomial of the nth degree” will be used in place of the 
longer expression “polynomial of the mth degree at most”. 


Even the case of identical vanishing is not excluded. A trig- 
1 1 
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- 


onometric sum, or more specifically a trigonometric sum of 
the nth order, is an expression of the form 


+ a COS x+ az cos 2r-+ -.» +a, COS ne 
+b, sin x+ b sin 27+ --. +b), sin nz. 


The definition is inclusive once more; the simultaneous vanish- 
ing of an and bn is not ruled out. 

These two types of approximating function show a persistent 
and fundamental similarity in their behavior, on which 
differences of more or less significance are from time to time 
superimposed. Simplicity of statement and proof will favor 
sometimes one and sometimes the other. 

It is readily seen that the number of terms required to 
yield a specified degree of approximation, or, under the 
converse aspect, the degree of approximation attainable with 
a specified number of terms, will be related to the properties 
of continuity of f(x). It is the purpose of the next pages 
to trace out this relationship in some detail. 


1. Approximation by trigonometric sums 


For a considerable body of results, the following theorem 
may be regarded as fundamental: 
THEOREM I, Jf f(x) is a function of period 2a, such that 


|S (v2) —J(%)| < A| 2e — zi 


Jor all real values of x, and xs, 4 being a constant, there will 
exist for every positive integral value of n a trigonometric 
sum T(x), of the nth order, such that, for all real values of x, 


O- yay) < E, 


where K is an absolute constant, depending neither on x, nor 
on n, nor on 4, nor on any further specification with regard 
to the function f(x). 

In the proof of the theorem, use will be made of the 
following 
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Lemma. Jf m is a positive integer, the expression 


sin‘ (mix/2) 
sin* (a/2) 


is a trigonometric sum in x, of order 2m—2. 
Because of the identity 


COS px cos qx = : [cos (p+q)x + cos (p — q) z] 


and the others of similar type, it is seen at once that the 
product of two trigonometric sums, of orders n, and ne 
respectively, is a trigonometric sum of order n, -+na. It is 
sufficient for the purpose in hand, therefore, to recall any 
one of the numerous proofs of the well-known fact that 


sin*(maz/2) = 1— cosmz 
sin? (a@/2) 1— cosx 


is a trigonometric sum of order m—1; its square will then 
be a sum of order 2m—2. The fact that 1— cosmz is 
equal to the product of 1— cosx by a trigonometric sum of 
order m—1 appears, for example, from the identities 

m—1 


1—cosmaz = > [cospx—cos(p+1)a], 


po 

cos px — cos(p+1)x 

= (1— cosx) — $ [cos(q—l)a—2cosqv 

q=1 
-cos (q + 1)z], 

cos(q —1)x — 2 cosqa + cos (q +1) 

= [cos(q— 1)x + cos (q +1) a2] — 2 cosqax 

= 2 cos qx cosx — 2 cosqar 

= — 2 cosqz (1 — cosz). 


To proceed with the proof of the theorem, let 


sin mu 
msinu 


4 71/2 
ly Dae) = im J „Jf @&+ 2u) Fmlu)du, 


1* 


Fatu) = [ 
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where m is any positive integer, and /» is defined by the 


equation 
1 vel : 
E R ap imu) du. 
By means of the substitution #-+-2u = v, the expression 
for Im(a) is transformed into 


Lil" yom. E En 


Both factors in the last integrand have the period 2% with 
regard to v, so that the value of the integral is unchanged 
if the interval of integration is replaced by any other interval 
of length 27. In particular, 


ae 5 hin f SO) Fm [e—a] dv. 


The expression Fm [4(v—a)], by the Lemma above, is a 
trigonometric sum of order 2m—2 in (v—z), and may be 
regarded as a trigonometric sum of the same order in 7, 
with coefficients which are trigonometric functions of v. The 
whole integrand is a trigonometric sum in æ with coefficients 
which are continuous functions of v, and J,(«#) therefore is 
a trigonometric sum of order 2m — 2 in x, with constant co- 
efficients. The proof that this sum is an approximate rep- 
resentation of f(x), when m is large, will be based on the 
original representation of Jm(z). 

Let the equation defining -m be multiplied by Jm/(x). 
Since f(x) is a constant as far as w is concerned, it may be 
placed under the sign of integration, so that 


I(x) p= S ea S (x) Fm (u) du. 


Consequently 
71, 


Im (x) —f (x) = hm pa [f(@+2u)—f(@)] Fal) du. 
By the hypothesis imposed on f(x), 
If (w@+2u)—f(@)| < 24 |u]. 
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Hence 
; */2 
| Im (a) —f(ax)| < 2hhim | | u| Pnr(w du, 


or, since Fm (u) and |u| Fm (u) are even functions of u, 


j2 
P F u Em (u) du 
| Im (2) —f (x)| < 44hm ui u Fin (u) du = 225 i - 


Eml) ey 


To anticipate the conclusion of the proof, let 


72 sin* t sin‘ ¢ 
Ci =f = tt, Ca =f E dt. 


These quantities are merely numerical constants. Is is clear 
that each integrand approaches a limit for ¢ = 0, and that 
the improper integral defining cə is convergent. 

By the use of the fact that 0 < sin u < u for 0 < u < 7/2, 
and the substitution mu = t, it is recognized that 


sin mu 1 ("2 sintt 
1 Se Fy (wu) du> re Ja au = — mae dt 
mu m Jo t 
12 
z. 2 sin’? € 
= y at = +, 
—= m0 t m 


On the other hand, (sin )/u decreases monotonically as wu 
goes from 0 to 2/2, so that 


sinu _ sin(a/2) _ 2 1 wt 


w ` (7/2) nx’ sinu 2 t 


throughout the interior of this interval. Hence 


i, u Fm (u) du <| Eue uf sin mu See Pas 
3 mu 
m7) 2 sint t 
= mè de 2 ) Í S a dt 
223. [ah EE a \* Cy 
mats (2) J, aka (3) mè ` 


From these relations it follows that 


| Im (e)—f(@)\ < ai, 
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Now let n be an arbitrary integer, and let m be taken 
equal to $n-+1 or 3(n+1), according as n is even or odd. 
In either case, 2m—2<n<2m. Let the corresponding 
expression Jm (x) be denoted by 7, (x). Then 7, (x) is a 
trigonometric sum of the nth order (it will be remembered 
that this is understood to mean of the mth order at most. 
according to the more usual terminology), and, since 1/m—<2/n, 


g b 
Lm FOS ve = xe 

if A is taken equal to a*cy/(4c,). Thus the proof of the 
theorem is completed. 

So much has been conceded to simplicity of outline, in 
building up the above inequalities, that the final upper limits 
are quite unnecessarily large, giving little indication of the 
actual magnitude of the quantities that precede. It will 
add a little to the definiteness of the conclusion to point 
out that cı >(2/m)*, since (sin f)/t>2/a throughout the 
interior of the interval of integration, while 


ə] rey woe 4 1 w% 
sin* ¢ sin* ¢ dt 
y t.—.— dt + u< f: +f - - 
a 0 t ; Í i £ Jo ae: Py ee 


n E 
K =e < gg <0 


With more attention to detail, however, the estimate can be 
cut very much closer. The theorem is actually true with 
K = 3, instead of the value adopted above, or even with 
a somewhat smaller value of K. On the other hand, it can 
be shown that the statement is not generally true with a value 
of K smaller than 7/2. 

To pass on to a more general theorem, let f(x) be an 
arbitrary continuous function of period 27, and let œ(ð) be 
the maximum of |f(a.)—f(z,)| for |a,—a,| < ð. The 
function (0) has been called by de la Vallée Poussin the 
modulus of continuity of f(a). With the word maximum 
replaced by least upper bound, it can be defined for any 
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bounded function, whether continuous or not. The character- 
istic property of a uniformly continuous function is that 
limg— (0) = 0. 

Let ọ (x) be the continuous function of period 2a which 
takes on the same values as f(x) at the points 


Qa 4a 27 
——, —a + PEEPI TE 


' > T; 
n n n 


and is linear from each point of this set to the next. The 
graph of y(x) is a broken line, no segment of which has 
a slope greater than (2a/n)/(2a/n) in absolute value. 
In analytical language, ọ (æ) satisfies the hypothesis of 
Theorem I, with 

w(27/n) 

2a/n 


For every positive integral value of n, therefore, there is 
a trigonometric sum 7',(x), of the nth order, such that 


p(x) —Ta(a)| < 5 (23). 


2 — 


On the other hand, any specified value of a differs by less 

than 27/n from one of those for which f and p are by de- 

finition equal to each other; neither f(x) nor p(x) can differ 

by more than #(2/n) from the corresponding common value; 

and hence : 

i SONAR 

f(a)—9@)| < 2a (27) 

for all values of x. If the quantity K/(2a)+2 is denoted 

by K’, the last two inequalities may be combined to yield 
the following statement: 

THEOREM I. Jf f(x) is a continuous function of period 
2a, with modulus of continuity (0), there exists for every 
positive integral value of n a trigonometric sum T,(x), of 
the nth order, such that, for all real values of x, 

|F (@)—Tr(2)\ < K'o za 


where K' is an absolute constant. 
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While this theorem is applicable to any continuous function, 
it involves the modulus of continuity in the inequality which 
forms the essence of its conclusion. It can be shown that 
the assignment of an outer limit of error for an arbitrary 
continuous function, without some dependence on properties 
of the function beyond the mere fact of its continuity, is 
impossible. 

Since lim,—~. o(2a/n) = 0, it is to be noted that 
Theorem II includes one of the theorems of Weierstrass to 
which reference was made in the opening lines of the chapter. 

In preparation for the next developments, there is occasion 
to examine more closely the proof that was given above for 
Theorem I. It will be recalled that to an arbitrary positive 
integer n a second positive integer m was assigned, in terms 
of which a function Fm (u) was constructed; and a trigono- 
metric sum T(x), yielding an approximate representation 
of the given function f(x), was defined as equal to an ex- 
pression which could: be reduced to the form 


Fafi 10 ra [ko-a]an 


lim being. independent of x. A lemma stated essentially that 
Fm ($u) is a trigonometric sum in u, of order 2m — 2 < n. 
It is possible therefore to write F,,[}(v—)] in the form 


Ano + > [Ann cos k (v—2) + Bux sin k (v—a)]. 
~ k=1 


When the above expression for 7;,(@) is expanded as a trig- 
onometric sum in a, the constant term is 


1 MF 
4 hm Ano fis (v) dv, 


and is zero if the last integral vanishes, an observation which 
will presently be important, for the reason that the indefinite 
integral of a trigonometric sum without constant term is 
itself a trigonometric sum, while this is not the case if the 
sum to be integrated has a constant term different from zero. 
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It may be pointed out in this connection, though it is not 
essential to the main argument, that the coefficients Bnw are 
all zero. This can be inferred from an elementary theorem 
on trigonometric sums, since Fm (4u) is an even function of u, 
and is also directly apparent on inspection of the proof of 
the lemma. If a, >, are the Fourier coefficients of f(x): 


1 z ET EEE A i hA fo 
a = — 3 (v) coskvdv, br = 5 fro sin kv dv, 
and if $a m Ang is denoted by dax, it is seen that 


1 n 
Tala) = 5 dno ao +2 dni:(ay coska: + by sink). 
= jo 1 


As the d’s are independent of the function to be represented, 
the calculation of the successive expressions 7),(#) amounts 
to a method of summation of the Fourier series for f(x). 

To return from the digression of the last paragraph, let 
f(x) be a function of period 27, which has everywhere 
a continuous derivative f'(x). For a particular value of n, 
let h(x) be a trigonometric sum of the nth order, without 
constant term: 


hz) = = (ar coska + By sink), 
and let ¢, be a constant such that 
F (a) —th(z)| < en 
for all values of x. Let tn(x) be the trigonometric sum, 
without constant term, which has t(x) for its derivative: 
n 


tn(x) = bA S sin kz — Bu cos kr] : 
K k k 


and let rala) = f (x)— ta (£). Then rn(x) has the period 27, 
and, since rh (x)| < en, satisfies the conditions imposed on f (x) 
in the hypothesis of Theorem I, with 4 = en. Hence there 
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exists a trigonometric sum of the nth order, which may be 
denoted by 7, (x), such that 

K €n 

ei 

If Talx) = tr(x)+tn(x), then f(x2)— Tp (x) = ra(x2)— tr (x), 
and 


Th (x) — iin (x) 


Ken 


|J @)— Tala)| <- 


From the existence of an approximation for / (æ) it has been 
possible to draw an important inference with regard to the 
approximation of f(a). If f(x) is itself the derivative of 
a function of period 27, so that the integral of f(x) over 
an interval of length 27 is zero, it follows that 


“1 or on 
pe Tr (x) dx = [so da = i (x) dx = 0. 


whence, according to the second paragraph preceding, the 
sum Tr (x) given by the proof of Theorem I as an approximation 
for ra(x) will have no constant term. So the constant term 
in the present T(x), defined in terms of this 7,(x), will be 
zero likewise. 

The way has now been prepared for a demonstration of 

THEOREM II. Jf f(x) is a function of period 2a, having 
a pth derivative fP (x) such that 


|S? (w,)— f(a) | L ilara 


Jor all real values of x, and x, À being a constant, there will 
exist for every positive integral value of n a trigonometric sum 
Tn(x), of the nth order, such that, for all real values of x, 


Kri 
nri 


SF (@)— Tn (x) => 


where K is the absolute constant found in the proof of 
Theorem I. 

It is to be noticed that the argument is based on the ex- 
plicit construction of the approximating sum in Theorem I, 
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and it is not clear that a smaller value of K in that theorem, 
justified by some different method, would necessarily be 
applicable here. The value K = 3, previously mentioned 
in connection with Theorem I, results from the same explicit 

construction, and is available in Theorem III. 
By Theorem I itself, there exists a sum T(r) such that 
f° (a) —Tma (@)| < E. 


On the basis of the more recent discussion, as 


[7° @ae = fo @—fo(—2) =, 


it may be understood that Tw (x) has no constant term. 
Since fP (x) is the derivative of the periodic function 
Jf» (x), furthermore, an approximating sum Th2(x) may 
be constructed for f(x), as indicated above, with 
Fy = KAn. and 

KA 


SP (w)— The a) S oats 


It p>1, f(x) is itself the derivative of the periodic 
function f- (x), and the constant term in The (æ) is zero. 
By a sufficient number of repetitions of the process, the 
theorem is established. 

In Theorem LI, even if the integral of f (x) over an inter- 
val of length 27 is zero, the same thing is not necessarily 
true of the auxiliary function g(a), and it is not clear that 
the approximating sum in the conclusion of the theorem will 
lack the constant term. The difficulty is not a serious one, 


however. If 
[a @ae = s, 


let pı (x) = o (x)— c/(2 7). Then 


ie gı (x) dx = 0, 


and there is a trigonometric sum Tm (x), without constant 
term, such that 
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K 27 
lgi (7)— Tm (@)| < Fy O (z5). 


But on the hypothesis that the integral of fdæ over a period 
is zero, 


cl = | f" p @az| = | f fy @—se)) ac| 


A 

bo 

3 

bo 

£ 
— 

| 
pe A 


and 
yeaa < 20 (7), |e) @)| < 40 (7%). 


n 


/ 


So the conclusion of Theorem II applies to the approximate 
representation of f(x) by a trigonometric sum without constant 
term, when f(x) is the derivative of a periodic function, on 
the condition merely that K’— K/(2a)+2 be replaced by 
K" = K/(2n)+4. 
The same process of induction which was used to prove 
Theorem III then serves to establish 
THEOREM IV. Jf f(x) is a function of period 2% which 
has everywhere a continuous pth derivative, with modulus of 
continuity (0), there exists for every positive integral value 
of n a trigonometric sum Talx), of the nth order, such that, 
for all real values of x, 
n ç 
ro-nol < £35), 


nP n 


where K is the absolute constant given by the proof of Theorem 1, 
and K" = K/(2m)+4. 

A part of the content of this theorem may be restated 
in the following 

COROLLARY. If f(x) is a function of period 2a which has 
everywhere a continuous pth derivative, there exists for every 
positive integral value of n a trigonometric sum Ty(x), of the 
nth order, such that 

lim n? en = 0, 


n=% 


if & is the maximum of | f(x) — T(x) ). 
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2. Approximation by polynomials 


The development of the theory of approximation by trigono- 
metric sums will be interrupted at this stage, to make way 
for a presentation of the corresponding results with regard 
to polynomial approximation. The transition will be aided, 
however, by one more lemma on the trigonometric side. 

Lemma. If f(x) is an even function of period 27, and 
if there is a trigonometric sum Talx), of the nth order, such 
that | f (2) — Ta (x)| < e for all values of x, there exists a cosine 
sum Cy(x) of the same order (that is, a trigonometric sum 
without sine terms), such that, for all values of x, 


| f (a) — C(x) | ss E, 


When f(x) is even, the approximating sum given by the 
proof of Theorem I will automatically lack the sine terms, 
as an immediate consequence of the fact, already pointed 
out, that its definition is equivalent to a method of summation 
of the Fourier series; and this observation would be sufficient 
for the main argument; but it is of interest to note that the 
lemma subsists independently of any particular mode of con- 
struction of the original approximating function. 

For the proof, let 


Cala) = 5 [T+ (a). 


Then C,(x) consists merely of the cosine terms of 7',(z), 
without the sine terms. On the other hand, since f(x) is 
even, 

. 1 

sæ = z+ aN, 

and therefore 

|1 1 
IF@)—Cu(@)| = FA Tna yS Tala] 
€% 


Now let f(x) be a function defined for —1 <2 < 1, and 
subject to the condition 


Lf (a2) —f(a,)| < ilr — z | 


IA 
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throughout this interval. Let 
x = cos À, S(@) = f(cos 0) = (8). 


Then y(6) is an even function, defined for all real values 
of 6, and 

lp (42) — p (01)| = |f(cos 62) — f (cos 4) 
< A\cos 0e — cos6,| < 4\0.— 0l. 


By Theorem I, together with the lemma just proved, there 
exists a cosine sum Cn (0), of the nth order, such that 
K2 


lø (0) — Cn (0)| < ee 


But a cosine sum of the nth order in ð is a polynomial of 
the nth degree in cos@, which may be denoted by P, (a), 
and the conclusion is that a polynomial P, (x) exists such 


that 


Fel = betel eea 


n 
for = 1,77 L. 
If the interval (—1, 1) in the hypothesis is replaced by 
an arbitrary interval (a, b), a preliminary transformation of 
variable may be made according to the formulas 


y= 728 fe) = AW), 


whereby fı (y) is defined for —1 < y < 1, and 


Si (ys) — fi (n) 
= fe) fa) < iaa] = F400) lyn: 


The result just obtained may then be applied to the approx- 
imation of fı (y) by a polynomial in y, which is at the same 
time a polynomial in z. The general conclusion may be 
formulated as 

THEOREM V. Zf f(x) satisfies the condition 


|S (£) —f(a1)| < Alre —.z | 
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throughout the closed interval (a, b), of length l, there exists 
Sor every positive integral value of n a polynomial Py (x), of 


the nth degree, such that 


fe) — Pa| < =4 


for asa <b, with L=3K, where K is the constant of 
Theorem I. 

If the smallest possible value of K were found in Theorem I, 
it is not clear that 4K would then be the smallest admissible 
value of L, but it can be shown that the validity of Theorem V 
is not general for any Z smaller than }. 

More generally still, suppose that f(x) is an arbitrary con- 
tinuous function for a < x < b, and let (0) be its modulus 
of continuity in this interval. With b — a = 1, let (a) be 
the continuous function which takes on the same values as 
J (x) at the points 


l 21 
@, OTT OT a O E G= at b, 


and is linear from each point of this set to the next. The 
function g(a), having a broken line for its graph, satisfies 
the hypothesis of Theorem V, with 


œ (l/ n) 


KE lin 


while 
J (x)— g(x), < 2w (Un) 
throughout (a, b). There is a polynomial P, (æ) such that 
|o (2) — Pa (x)| < Lø (l/n); 
setting L42 = L’, one may state 
THeorReEM VI. Jf f(x) is a continuous function with mod- 
ulus of continuity œ(ò) in the closed interval (a, b), of length 
l, there exists for every positive integral value of n a poly- 
nomial Pa (x), of the nth degree, such that, for a S< x < b, 
| f @)— Pa(x)| < L’ w(l/n), 


where L' is an absolute constant. 
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This incidentally includes the theorem of Weierstrass on 
polynomial approximation, which was quoted at the beginning 
of the chapter. 

The proof of a theorem corresponding to Theorem III is 
simplified by the fact that the indefinite integral of a poly- 
nomial is always a polynomial, so that special considerations 
analogous to those relating to the constant term in the trig- 
onometric case are not needed. A new complication is 
introduced, on the other hand, by the circumstance that the 
degree of a polynomial is raised by integration, while the 
order of a trigonometric sum remains unchanged. 

Suppose that f(z) has a continuous derivative f’ (æ) for 
a <a < b, and that there is a polynomial py (x), of degree 
n—1, such that 

|S’ (@)—pn (@)| < én 


throughout the interval. Let 


‘iE pr (x) da = Par (2), SF (x)— Pn (2) = Ta (@). 


Since 7) (x)| < En, rn (x) satisfies the hypothesis of Theorem V, 
with 4 = ê. There is consequently a polynomial 7, (£), 
of the nth degree, such that 
| oa Ll n 
[Tn (£) — an (x) | = = =e 
If pn (£) + n (£) = Pp (x), this Pa (x) is a polynomial of the 
mth degree, and 
. Llen 
|J (x) — Pn (x)| = |1'n (£)— nn (x)| = Ea 

Let f(x) have a pth derivative / (x), satisfying the 

condition that 
|S P (t) —f P (a)| < A|ze— a | 


throughout (a,b). If m—p>O, there is a polynomial of 
degree »—p which differs from f(x) by not, more than 
L1i/(n—p) throughout the interval. There is then, by the 
preceding paragraph, a polynomial of degree n—p+l, 
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differing by not more than L7/°2/[(n—p)(n—p+1)] from 
Jf” (æ), and so on. Finally a polynomial P,(z) is obtained, 
of the nth degree, for which 
i= ino Le pty 
SF (x)— Pr()| = (n—p) (n—p+1) ror a, 

For the applications, the existence of a constant in the 
right-hand member of the last relation is far more important 
than any close estimate of its numerical value. It is most 
convenient, even at considerable unnecessary expense nume- 
rically, to be satisfied with the observation that 


N f 1 “L TE, = 
(n—p)(n—p+1])---n n—p 1— pi 
St rhe a 
+. Task . nori = p! ner 


for n =p-+1, and to state the result in the form of 
THEOREM VII. Jf f(x) has a pth derivative fP (x) satisfying 
the condition that 


lf (a2) —f (ay)| S Ala2— ay 


throughout the closed interval (a, b), of length l, there exists 
Jor every integral value of n> p a polynomial Pn (x), of the 
nth degree, such that for asx b, 


F(x) —Pa(z)| S npr =; 


where Ly = (p+1) LP /p!, and L is the constant of 
Theorem V. 

It is clear that even with 4 — 0 the hypothesis implies 
nothing whatever as to the possibility of approximating f(x) 
by a polynomial of degree lower than p, since f(a) itself 
may then be any polynomial of the pth degree. By suitable 
changes in formulation it would be possible, though of 
secondary interest, to admit the value n = 0 in Theorem V 
(or Theorem I), and the value n = p here. 

From Theorem VI, by reasoning similar to the above. one 
may deduce 
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THEOREM VIII. Jf f(a) has a continuous pth derivative with 
modulus of continuity (0) throughout the closed interval (a, b), 
of length l, there exists for every integral value of n>p 
a polynomial P, (x), of the nth degree, such that for a= x< h. 


Pn Pee l 
F(x)— Pa (x)| < = o( 25) 


where Lp = (p41) LP (L+2)/p!, and L is the constant 
of Theorem V. 

COROLLARY. Lf f(x) has a continuous pth derivative for 
aSxs< hb, there exists for every positive integral value of n 
a polynomial P, (x), of the nth degree, such that 


lim Pen = 0, 


n=% 


where €n is the maximum of fle)— Pax) in the interval 
(a, b). 

The exceptional status of the values of n< p has no 
significance for the corollary, which is concerned only with 
a limit for n = œ. 

3. Degree of convergence of Fourier series 

The preceding theorems can be made to serve as basis 
for a discussion of the convergence and rapidity of convergence 
of Fourier and Legendre series, as well as of other processes 
of approximation. This idea will be developed more fully in 
succeeding chapters; its first consequences will be presented here. 

With regard to the Fourier series for a given function f(z), 
it will be premised merely that it is a series of the form 


e d- Suu cos kæ + dy sin ke), 
in which the coefficients have the values 
Be ee Et eee S 
ye = 1 f s@ cosktat, by: = [7s sinke at, 


the formula for ay being applicable when k = 0, as well 
as when k is positive. The expression 
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Sa(x) = = +2 (a; cos ka + by sin kx) 


will be called the “partial sum of the series to terms of the 
nth order.” 
If the quantity 


a + cosu + cos2u -+ --- + cosnu 
is multiplied by 2sin}u, the product may be rewritten as 
sin 7 + [sin ® SA sin x] +- 


«=F [sin le F z) u — sin (n = z) u| ; 


which immediately reduces to sin (n+ 4)u, so that 


; + cosu-+cos2u+.---+cosnu = ae 
Let each coefficient in S,(#) be replaced by the integral 
expression which defines it. Since coska and sinks are 
independent of ¢, they may be written under the sign of 
integration, and the various integrals may be combined into 
a single one: 


%7] n 
S, (a) = . Lf F(t) E + & (coskteos ki} sinktsinkx)| dt 


a =1/" roofs + ase 


Bog sin(n +3) (t—2) | 
=i fifo 2sin $(¢— x) 


From the last expression may be deduced the following: 

Lemma. If f(x), of period 27, is bounded and integrable 

(in the sense of Riemann or in the sense of Lebesgue), if 
S(@)| < M 


for all values of x, and if Sn(x) is the partial sum of the 
Fourier series for f(x), to terms of the nth order, then 


oe 
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|S, (a)! < CMlog n, 


for all values of x and for all values of n>1, where C is 
an absolute constant, depending neither on x, nor on n, nor 
on the function f(a). 
By the hypothesis on f(x), 
POETA ME AT 


2 sin 4 (t — 7) 


If one-half the integral on the right is denoted by jn, its 
form may be modified by the substitution u = 4 (t— x), and 
by recognition of the fact that the resulting integrand is an 
even function of u, of period 7, to yield the conclusion that 


|du 


3 1 paw | sin (2n + 1)u 
m= p a A 
= J—(1+2)/2 sin y 


; du, 
2 —n12 SIn u i 


— 1 fm | sin (Qn+1)u lau _ pz} sin (2n+ 1) 
li | sin u 


2M. 
Sn (x) < wen Jn» 
From the fact that 
sin (2n +1)u 5 < OL 
Sm = 1 +È cos 2ku 
it follows that 
| sin (2n + 1)u | 


: < 2n+1 
sin = E 


for all values of u. On the other hand, |sin (2n + 1)u ~ 1, 
while 
sinu _ sin (æ/2) _ 2 t 27-1 


} m 


uy = g n sinu © 2 u 


throughout the interval of integration. So 


Jn = 1 ess) 2 [ent vant m pan du 
0 1/n 0 in u 


1 is 7 7 
= At z gS af > logn. 
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The last expression does not exceed a constant multiple of 
logn, for n = 2, and the conclusion of the lemma is justified. 

The method of application of the lemma, which is due to 
Lebesgue (though he did not make so extensive use of it as 
is done here), may be summarized in 

THEOREM IX. Zf f(x) is a continuous function of period 27, 
and S»(x) the partial sum of its Fourier series to terms of 
the nth order, n>1, and if there exists a trigonometric sum 
Tn(x), of the nth order, such that 


(FE —Tr@)| < en 
Sor all values of x, then, for all values of x, 
I(x) — Sp (x)| < Ben log n, 
where B is an absolute constant. 
The statement is equally true, though of less interest, if 
f(x) is merely assumed to be integrable; of course en can 
not approach zero, when the relations are considered for 


successive values of n, unless f(z) is continuous. 
Let the sum 7',(x) in the hypothesis have the expression 


n 
S + (a, cos kx + pr sin kz). 
á k=1 
It is found by direct integration that 


E Ptahi kn e A Tn(t) sin kt dt = Br, 
m 1 Taen 


for k< n, so that the partial sum of the Fourier series for 
Ta (£), to terms of order n, is identical with T,(x) itself. Tf 
the Fourier coefficients of f(x) are ax, be, those of the function 


Ryi(w) = fœ) — Tr(a) 


are d~— ex, by—Bx, for k <n, and the partial sum s, (a) 
of the Fourier series for R,(a), to terms of order n, is 


ae z big =a: Pe [(ar— ar) cos ka + (br— By) sin kx] 
=== Sh (x) se Tn (x) , 
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Since | R,(x)| < £n, it follows from the Lemma that 


\sn(v) < Cen log n. 
Consequently 


| F(a) —-Sn (x)| < | f(a) — Ti (x) | PMES 5, zs 
= | Ra (x)| + |sn(£)| < Ent Cen len Stee for 5 tee log n 


for n = 2, and the last expression has the form Be, log n, 


with B = (log 2)-'+C. 
The theorem may immediately be specialized and made 
more definite by combination with Theorems I—IV, as follows: 
Corouuary I. Jf 


\f (ae) — f @)| S 4\22—2 | 


Sor all values of x, and xz, 2 being a constant, then 


- Adlog nm 
n 


| f @)— Sa (x)| S 


CoroLLARY II. Jf f(x) is continuous with modulus of con- 
tinuity œ (ò), 
Qu 
S(@)—Sn(x)| < A o| = log n. 


COROLLARY Ila. The Fourier series converges uniformly to 
the value f(x), if f(x) has a modulus of continuity (0) 
such that limg—, (0) log 6 = 0 (Lipschitz-Dini condition). 

CoroLLARY MI. Jf f(x) has a pth derivative fP (a) such 


that ; 
|SP (a) —f' (a4)| < |ts — t | 


Sor all values of x, and xz, 4 being a constant, then 


ApŻlogn 
ss 


| F (@)—Sn (x) 


IA 


CoroLLARY IV. Zf f(a) has everywhere a continuous pth 
derivative with modulus of continuity œ(ò), 


\ f(a) —Sn (x)| < sf o(?7) logn. 


www.rcin.org.pl 


I. CONTINUOUS FUNCTIONS 23 


In each of these statements, the conclusion holds for all 
values of x, and for all values of n > 2; the coefficient A is 
an absolute constant, while Ap depends only on p. 

The corollaries are stated separately for emphasis; it is 
clear that all are included in Corollary IV, if it is under- 
stood that p may in particular have the value 0. The use 
of the same A in I and II, and of the same Ap in III and 
IV, signifies merely that when two constants are concerned, 
one symbol may be used to represent the larger of them. 

It is worthy of note that even the dependence of A, on 
p can be eliminated, with a resulting simplification in Cor- 
ollaries ITT and IV which does not have a counterpart, as 
far as the present evidence goes, in the case of the corre- 
sponding third and fourth Theorems. 

If f(x) is a function of period 2a having a continuous 
derivative, the Fourier series for f(x) is that obtained by 
formal differentiation of the Fourier series for f(s). This 
is recognized without any further assumption as to the con- 
vergence of the series, from the relations 


OL ar 

i „SO coskiat = K já SO sinktdt, 
wT mt 

Fro sin ktdt = —k {ro coskt dt, 


which are obtained by integration by parts, the terms which 
would appear outside the integral sign reducing to zero, 
because of the periodicity of the functions involved. If the 
coefficients in the series for f(x) once more are ax, bk, and 
if f(z) has a continuous derivative of order 2q, where g is 
any positive integer, the Fourier coefficients for f°” (x) are 
uk = (— 1) kt ar, Be = (—1)4 k bk, Which means that the 
series for f(x) can be written in the form 

llo 1 : 

> +(—1)@ 2 Jea («r cos kx -+ r sin kz). 
The partial sums of the series for f(x) and for f°? (æ), to 
terms of the nth order, may be represented by S, (x) and 
SE? (x) respectively. 
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Let 
YA tiad (2) — sy” (x) = on (2), 


and suppose now that |g, (x)| < én, where é 41 < £n for all 
values of n that are considered, and lim,—x £n = 0. Then 


ay COS kin -+ Be Sin ke — op—1 (£) — ox (x). 


By any of the preceding corollaries, Ila, for example. the 
series for f(x) converges to the value of the function, while 
the series X oxı (x)/k*4, X oy (w)/k*4 are convergent, because 
ox (x)| is uniformly bounded. Hence the remainder f(a) — Sn (a) 
can be written and rearranged as follows: 


H 
Fah — Seen (1. AEE TEE EE 
knp Ka 


= (—1) è > i [ex—1 (£) — ex (2)] 


=n+1 


= | — ar ee 


as (a T 7) Qk | . 


As the parentheses in the last summation are all positive, 

and as |ọx(x)| < ek < & for k > n, 
Fla) — 84(@)| < ete S a | 

"= (n+1)% kna kb (k41) 

we Zen << 2 

(n-+ 1)4 ma * 
Under the hypotheses of Corollary IMI, let p = 2q if p is even, 
p = 2q-+1 if p is odd. Then Corollary I or Corollary MI 
may be applied directly to f°” (x), which is found to satisfy the 
requirements of the preceding paragraph, with £n —(A/logn)/n 
in one case and £a = (A,/4logn)/n® in the other. The 
corresponding values of 2¢é,/n™4 are (2AAlogn)/n?** and 
(2 A; å log n)/n?*? respectively. For literal accuracy. it must 
be admitted that (log »)/n diminishes only from n = 3 on, 
and that the conclusion has been established, when p is even, 
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only for n = 3. The value n = 2 may be included if the 
previous £ is replaced by a somewhat larger quantity, with 
a suitable adjustment of the constant in the conclusion, if 
necessary. 

Similar reasoning may be employed under the conditions 
of Corollary IV, except that it is no longer satisfactory to 
take 2q =p when p is even, as (2a/n)logn might not 
decrease with increasing n. It is sufficient, however, to let 
p = 2q+2 when p is even, p = 2q +1 when p is odd, and 
to obtain £n by the application of Corollary IV to f° (x) 
as thus defined. From the definition of w(ð) it is certain 
that »(2/n) itself diminishes, or at any rate does not 
increase, as n increases. 

All the cases in question are covered, with some redundancy, 
if the following are considered successively: for œ(0) < 20, 

= 0, 1, 2q (q = 1), 2q+1 (q =1); for general œ (ð), 
p= 0,1,2, 24+1 (421), 2¢+2 (421). Ifa single letter 
is used to represent the largest of the finite number of con- 
stants entering into the corresponding conclusions, the result 
may be formulated thus: 

THEOREM X. The preceding Corollaries INI and IV may be 
restated, for all values of p Z0, with the multiplier Ap replaced 
by an absolute constant D, depending neither on p nor on 
anything else. 

4. Degree of convergence of Legendre series 


A considerable part of the above reasoning can be carried 
over to the case of Legendre series, though the relations 
are less simple than for Fourier series, and the results as 
presented here will be less complete. 

By the Legendre series for a given continuous function f (s) 
is meant a series of the form 


ay Xo (x) + ay Xi (x) + ag Xo (x) +- 
where Xx(æ) is the Legendre polynomial of the th degree, and 


PKL f! SO Xat 
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Not to work out the theory of these polynomials from the 
beginning, it will be assumed as known that one of them 
is defined for each integral value of k > 0, the first two 
being Xo = 1, X, = z; that they satisfy the relations 


1 a 2 
, 7 = -47 72 = es 
fixo AdE = 0 AE, f xou IF] 
that any successive three of them are connected by the 
recursion formula 
(k +1) Xet (x) — (2h +1) a Xe (x) + kX- (x) = 0; 


and that the polynomial of the kth degree can be expressed 
in the form 


7t 
Xr (æ) = + f: [z+ i(1 — 2)" cosg)" do. 


In the last expression, the presence of imaginaries is only 
superficially apparent; if the integrand is expanded by the 
binomial theorem for a positive integral exponent, the co- 
efficient of each odd power of 7 after integration is an integral 
which is seen at once to be equal to zero. 

Let Sa(x) stand for the sum of the first »-+1 terms of 
the series: 

Sn(x) = do Xo (£) +4, X, (x) ++ - -+ an Xn (a). 


By the definition of the coefficients, 


Sua) = f SO Kn, Oat, 


where 
Kn (x, ) = ; [Xo (x) XAH (w) M(H+ -- 
7+. +(2n+1) Xn(x) Xn W]. 


This function can be rewritten in the form 


Xni (æ) Xn(Q—Xn(@) Ana (À) 


ar—t 


Ky (x, t) == dani 
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the identity is immediately verified for n = 0, and may then 
be proved in general by a straightforward process of induction., 
based on the recursion formula. 

In the integral expression for X(x), when x <1, 


£ + i(l = FA cos Y } = [x* + (1 — z’) cos? A a 
akpi = 1, 
and hence X;(xv) <1. There will be occasion to use a closer 


inequality for X;,| in the interior of the interval (—1, 1). 
To return to the integral representation, 


oo i 
Xv) < 1 | [z+ (1 —3*) cos” g] do 
Jo 


2 m/2 
= [2? + (1 —z*) cos* gp}? dg, 


IT 0 


the last equality resulting from the fact that cos* p = cos? (r — g). 
Since 


x? -+ (1 —.*) costo = cos*g + 2° sing = 1—(1—2") sin? y, 
and since sin y = 2/7 throughout the interval of integration, 


while 1—2* is positive for the values of x under considera- 
tion, it follows that 
ae | 


a? -+ (1— x?) cos? gy < 1— 4 (1—x*) g = 1— $*g?, 


if ë = 2(1—.*)'*/a. By an application of the extended 
mean value theorem to the function e”, 


eu — 1— y+ z yee, OEE e e 


so that eY >1—y for all real values of y, and in the 
present connection 


So 


2 el2 2 x 
Xe(n) | < =a Pride 2 J eerdg, 
T Jo T J0 
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and, by the substitution u = (4K)? E, 


Ci 


es ak ae 
| Xe (a) | S a J, edu = Taq—aye >O. 


where ¢, is independent of k and x; the numerical value 
c& = (2/2)? is not essential for present purposes. 

From the relation just obtained it follows, first, that if æ 
is restricted to an interval — 1+ $y<2—1—}9,0<9 <2, 
then 


Xi (x) | < aA , 


where g is independent of k and æ, but depends on 4; and 
secondly, that 


‘ A dx Ca 
le | Xx (x) | dz < wa mye — jaa? 


where cə is independent of k. 

It is possible now to proceed to the proof of the following 
lemma, which assigns an upper bound for |S» (x)|, not through- 
out the entire interval —1 < æ < 1, to be sure, but through- 
out an interval interior to it: 

Lemma. If f(x) is bounded and integrable (in the sense 
of Riemann or in the sense of Lebesgue) for —1 <a <1, if 


S@)| < M 


throughout the interval, and if S,(x) is the partial sum of 
the Legendre series for f(x), then 


Sa (x)| < GM logn 


Jo —1 +y Sr <1— nr, 0<4<1, where G does not depend 
m x, n, or the function f(x), but does depend on n. 
The proof starts from the fact that 


OE uf, | Kn (a, )| dt. 


Let the interval of integration be divided into five sub-intervals, 
terminated by the points 


1 1 1 
<2), ST a Ae Ae n’ I-34 1. 
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Under the hypothesis that — 1 -+ < x < 1—y, these points 
will succeed each other in the order named, as soon as 
n>œ>?2/ņn. Tt will be assumed for the present that this con- 
dition is satisfied; then, in particular, n +0, and as there 
will be occasion to observe incidentally, [(n + 1)/n] 2 <2. 
Let the values of the integral of | Kn! over the sub-intervals be 
denoted by J,.---,Z, respectively. The relation |X; (8| < g/k"? 
can be used in the second, third and fourth integrals, and 
the relation | X;({)| < g/k"*, which is independent of ¢, in 
any of the five. In the middle interval, 


K, (æ, )| = |= > (2k + 1) Xr (£) Xr (0) 


att x1 Í 
sitie 


n 
<1+ D> 2g? = 1+2g'n < (29°+1)n, 
k=1 
and hence 


etl /n 
l, = ih fee | Kn (a, £) | aa. 2g + 1)ndt = 2 (28+ 1). 


-1/1 


The representation of K,(a, t) as a fraction is to be used in 
the remaining integrals. In the first interval, a—t| > $4, 
and 


n g [Kn (a, t)| at 
atin 
AR S Xn (x)| f; | Xn(t)| dt 


14.4 
+ | Xn (x) J | Xn+1 (2) | d (| 


$e faaola] 


ee ck 
aera (cosy eC 


rt, yk 2ceg 2 4esg 
= a hate 9 
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In the same way, /;<4¢.g/y. In the second interval, 


1 5 
AEL Ynti (a) Xa ()— Xn (2) Xapi (0) 
c nt! 29° PAA 
AE A ETN ~ °9" 
so that 
»x—1/n ax—ljn 
l aij Sp Kæ ojat<ag* | dt 


1 g 
+9 Be aes t . 


or, by the substitution «—/ = u, 


Ho du a 
n<ag f 77 cage fat — 2g dog 2 + log n). 
Sijn u Iin U 

Similarly, 4, < 2g” (log 2 +log n). As n is an integer satis- 
fying the condition n >?2/}>2, it is certain that n > 3. 
lognœ>1, and the inequalities that have been obtained for 
lı, ***, Js will merely be strengthened if the factor log » 
is inserted in the right-hand members wherever it does not 
occur. By combination of these inequalities, 


1 
JS (Kn (x, ldt < G; log n 


for n> 2/n, the number G, depending only on y. For each 
value of n belonging to the range 2 < n < 2/ņ, the integral, 
considered as a function of x for — 1 +4 <x <1—y, has 
a maximum value. Let Gs be the largest of the finite 
number of maxima thus determined. Then G depends only 
on 4, and the statement of the lemma is true for all values 
of n > 2, if G is taken as the larger of the numbers G,. 
G./log 2. 

For the application of the lemma, it is to be noticed that 
an arbitrary polynomial of the nth degree can be expressed 
identically as a linear combination of NX (%), +, Xn (œ). 
with constant coefficients. Then a process of reasoning 
which is entirely analogous to that used in the proof of 
Theorem IX, and which need not be repeated at length, 
serves to demonstrate 
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THEOREM XI. Jf f(x) is a continuous function for 
—1< x < 1, and Sn (x) the sum of the first n+ 1 terms 
of its Legendre series, n>1, and if there exists a polynomial 
Pa (x), of the nth degree, such that 


|S (x) — Pu (a)| S En 
for —1 <x <1, then, for—1+q4 Sz < 1—7 (>00), 


| f (7) — Sp (x)| < Hen log n, 
where H depends only on y. 

The more specific results obtained by combining this prop- 
osition with Theorems V—VIII will not be formulated sepa- 
rately, but will be summarized in a single 

COROLLARY. If f(x) has a continuous pth derivative (p => 0) 
for —1 <x < 1, with modulus of continuity (0), and if 
Sn (x) is the sum of the first n-+1 terms of the Legendre 
series for f(a), then 


H, 2 
4 — Sn T ~ ? ) 
fæ) (x) oP al P logn, 


for n>p and for —1+y Sx <1—n, where Hp de- 

pends on y and on p, but not on anything else; in parti- 

cular, the series converges uniformly to the value f(x) for 

—1+y7 52 < 1—7, if fŒ) itself has a modulus of con- 

tinuity o(ð) for —1 <s <1, such that dim (0) log d = 0. 
=0 


Since 7 may be arbitrarily small, the last conclusion implies 
that the series is convergent, not necessarily uniformly, 
throughout the open interval —1<27<1. 

Gronwall(Mathematische Annalen, vol. 74 (1913), pp. 213-270; 
Transactions of the American Mathematical Society, vol. 15 
(1914), pp. 1-30) has shown essentially that 


1 
T Kn(x, t) ldt < hon’ 


for —1 <x <1, where Go is an absolute constant. It follows 
that conclusions analogous to those of the preceding theorem 
and its corollary hold for the entire closed interval (—1, 1), 
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if the factor logn is replaced by n'*, the corresponding 
sufficient condition for uniform convergence being that 
limy—» (0)/6* = 0. But the proof of the relation of in- 
equality for the integral appears to be rather long, and will 
not be set forth here. From the definition of Kn, together 
with the facts that |X; (#)|— 1 for —1 <x <1 and that 
fix dt < ¢s/k?, it is obvious that 


a1 n 
3 1 : ale ja 
J Ky(x, t)\dt © 1+ 2 = (2k +1) c/ k< cy ne’, 


where cs is an absolute constant; and it can be inferred at 
once that the series converges uniformly to the value f(z) 
for —1—a2<1, if f(x) has a first derivative with a modulus 
of continuity «e (ò) such that limy—, #(0)/d’? — 0, while 
there are corresponding theorems on degree of convergence. 
In a later chapter, conditions will be obtained which are 
closer than those thus indicated, though not so close as the 
ones corresponding to the factor n°. 

From the discussion of approximation in terms of polynomials 
and trigonometric sums it is natural to pass to similar questions 
with regard to developments in series of more general functions, 
particularly the characteristic functions defined by linear differ- 
ential equations with boundary conditions. The beginnings 
of such a theory have been presented by the author (Trans- 
actions of the American Mathematical Society, vol. 15 (1914), 
pp. 439-466) and W. E. Milne (the same Transactions, vol. 19 
(1918), pp. 143-156). The present account, however, will 
be continued along other lines. 
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CHAPTER IU 


DISCONTINUOUS FUNCTIONS; FUNCTIONS OF LIMITED 
VARIATION; ARITHMETIC MEANS 


Introduction 


The discussion hitherto has been concerned almost entirely 
with uniform convergence, and with functions that are con- 
tinuous throughout the interval under consideration. Corres- 
ponding theorems with regard to the approximate represen- 
tation of discontinuous functions are naturally less simple, 
and perhaps of less immediate interest. It will be well, 
nevertheless, not to disregard entirely the question how far 
the theory that has been outlined can be brought to bear 
on the representation of such functions. This question will 
occupy the present chapter. (For a somewhat different set 
of theorems on the approximate representation of discontinuous 
functions, reference may be made to a paper by C. E. Wilder, 
in the Rendiconti del Circolo Matematico di Palermo, vol. 39 
(1915), pp. 345-361.) There will be occasion also for the 
further development of the theory as applied to continuous 
functions, by reference to the concept of limited variation, 
and by study of the summation of Fourier series according 
to the method of the first arithmetic mean. A beginning 
will be made by a review of some well-known general theorems 
about Fourier series, 


1. Convergence of Fourier series under hypothesis 
of continuity over part of a period 


Let f(x) be a function of period 27, which is summable 
over a period, together with its square. (Without change in 
the form of the argument, the discussion can be kept elementary 
for the present, with results which still have a high degree 

` 33 5 
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of generality, if it is assumed instead that f(x) is bounded 
and integrable in the sense of Riemann.) Let 


1 7E 1 an 
=p jar fib cosktdt, bk = om F(t) sin kt dt, 
e-n 


2 


Kæ = + Du cos ka -+ bp sin ka). 


Then, as may be verified by multiplying out the square and 
integrating term by term, 


L f" Foso 
E vl. DOLS 2 f(x) Sq (a) dae+ Lf [scold 
sif [Fe] dz—2 ® 4 Saiti] 
++ E a+] 
=1(" yora—[S+3 a+]. 


As the first member can not be negative, it must be that 


@ 4 Sati < a [ode 


for all values of n, and hence that the series 
as RE EP 
y+ & (ae + bi) 


is convergent. It follows that for any function f(x) of the 
character specified, the coefficients ax, be approach zero 
as k becomes infinite. The identification of the sum of the 
series with the value amfi JON dx is not needed for 
present purposes. 

The assumption that [/(z)]* is summable is not essential 
to the truth of the conclusion. On the hypothesis that f(x) 
itself is summable, let fy(®) be the function which is equal 
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to f(x) when | f(2)| < N, and equal to 0 when |f(x)| >A 
Let ax, bx be the Fourier coefficients of f(x), and ary, dew 
the corresponding Fourier coefficients of fy(x). It is known 
from the earlier work that limy— x dkv = 0, limp—x bey = 0, 
for fixed N, since fy(x) is bounded. Let ¢ be an arbitrary 
positive quantity, and let M be taken so large that 


t Wows lapel 

=f IAA) toys 
The hypothesis of summability implies that such a choice 
of N is possible. Then 


7 
: | | [LA (2) — fry (x)] cos ka da < = E, 
e a á 


a 
NA a kolde Ss 
ory a 2 
for all values of k, and hence 
| dke— arn | < z &, |br—bey| S > £. 


But for the particular value of N in question there is a ko 
such that |arn| < $e, |bren| < de, for k > ko, and for 
values of k subject to the last condition it follows that 
|ar| < €, |b| < e. If f(x) is a summable function of period 
2m, and if ax, be are its Fourier coefficients, 

lim a, = 0. lim by = 0. 

k=% k=% 

Now let f(z), still summable and of period 27, be supposed 

to vanish identically for a—y < x < m+y, O<y<a. 
As was shown in the first chapter, 


Ba (r) =} fire yeu Moe ay 


2 sin 4 (t— xo) 


Let ¢—a = u; inasmuch as the integral of a periodic 
function over a period is the same, wherever the initial 
point is taken, the limits —a, æ% may be retained after the 
substitution: 


a 
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Sy (aa) = + j: fet OTD" gu, 


2 sin du 


or, after expansion of sin (nu -+ 4u) by the addition theorem, 


7 


Sa (2) = = f SF (a +u) cot 5 u sin nu du 
IE eJ t 


7E 
+ a fi FS (ao +u) cos nu du. 
Or Von 


The factor cot 4« is bounded and continuous over the range 
of integration, outside the interval where / (xo + u) vanishes 
identically. Hence (2+ x) cot}u, like f(a + wu) itself, 
is a summable function (or a bounded function integrable in 
the sense of Riemann, if this hypothesis was originally im- 
posed on /f(x)), and the reasoning of the preceding para- 
graph is directly applicable to show that both integrals in 
the last expression approach zero as n becomes infinite; 
that is, 

lim Sn (£o) = 0. 

n= 0 
More generally, if f (x) and g(x) are two summable functions 
(or two functions satisfying the alternative hypothesis), if 
Sa (x) and sa (x) are their respective Fourier sums of the 
nth order, and if f(x) and ọ (x) are identically equal for 
zo—n < x < rotnn, then 


lim [Sn (a9) — Sn (£o)] = 0; 

n=% 
if the Fourier series for y(x) converges for x = x to the 
value (ao) = f(a), the Fourier series for f(x) does the 
same. This is commonly expressed by saying that the con- 
vergence of the Fourier series for a given function at a speci- 
fied point depends only on the behavior of the function in the 
neighborhood of the point; it is tacitly understood that only 
functions of some specified class are considered, as, in the 
present connection, summable functions, or functions which 
are bounded and integrable. 
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It is possible then to pass immediately from Corollary Ha 
of Theorem IX in the first chapter to the following statement: 

THEOREM I. Jf f(a) is a summable function of period 2x (or, 
less generally, a function of period 27 which is bounded and 
integrable in the sense of Riemann), and if there is an inter- 
val (x»— n, X +4), O<yn<a, throughout which f(x) is 
continuous, with a modulus of continuity œ (0) such that 
limy—, @(0) log ð = 0, then the Fourier series for f(x) con- 
verges for « = x to the value f (xo). 

For a periodic function (x) can be defined as equal to 
fæ) in (a —y, 7+), and linear, say, from zot q to 
vq —y-+2a, and this g(a) will have a modulus of continuity 
satisfying the requirements of the Corollary cited. 

Attention will next be directed to the simplest case of 
convergence at a point of discontinuity. Let f(x) now be 
a function which has a “finite jump”, or discontinuity of the 
first kind, at the point £ = 2, approaching limits which may 
be denoted by /(a%—) and f(a +) as x approaches zo from 
the left and from the right respectively. Let it be supposed 
that the values of f(x) in the interval a»)— y < £< zo, to- 
gether with the value /(a)—), form a continuous function 
with modulus of continuity »,(0), and that f(x) is likewise 
continuous, with the other limiting value at 2), in the interval 
from zo to zot, the modulus of continuity this time being 
w (8). For each value of ð, let (6) represent the larger 
of the numbers w, œ. It will be said briefly that f(x) is 
continuous for %—n < x < x+y, with modulus of con- 
tinuity (0), except for a finite jump at the point x». It is 
understood always that f(x) is summable over a period. If 
limy—, (0) log ð == 0, Theorem I establishes the convergence 
of the Fourier series for f(a) at all interior points of the 
interval (%— 4, zo+ 4), other than 2), and it remains to 
consider the question of convergence at the point 2» itself. 

For this purpose, it may be assumed without loss of gene- 
rality that a — 0. For the terms of the series which involve 
cosna and sinnw, taken together, are identical with the 
corresponding terms in the development of f(x) as a function 
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of x—2. In formulas, let z— to = y, f(x) = (y), 
t—2 =u. Then u—y = t— zr, and 
; n 
cos ny ie o (u) cos nu du -+ sin ny f”, o (u) sin nu du 
èt 
— F g (u) cos n (u — y) du = fro cos n(t— x) dt 
t 7 
= cosn zf so cos nt dt + sin nx J” FO sin nt dt. 


The assumption that a) = 0 is therefore equivalent to a change 
of variable which does not affect the conditions of the problem. 

With this understanding, let the symbols f(v-++) and f(a —) 
be replaced by f(0+) and f(0 —), let 


A(z) =f(2) O<2<a), Aa =Al(—a) (—a<2r<0), 
Ji (0) = f+), 


and let f2(x) similarly be an even function identical with f(x) 
for —a =< «<0, and taking on the value f(0—) for x = 0. 
Let Sm (x) and Sne (x) be the partial Fourier sums for f(s) 
and f2(x) respectively. The function fi(x) is continuous 
throughout the interval — y <x <4. If x, and zy are two 
numbers belonging to this interval, and if |a.—2z,| < ð, 


ht (xy) =Y (2)| < (0d) 
if x, and xə have the same sign, or if one of them is zero, and 
Ai (@2) —fi (m1) | S| fi es) —A (0) + Ai (0) — fi (a1) | < 2 @(9) 
if xı and zə have opposite signs. Similarly, f.(~) has a 
modulus of continuity < 2@(d) for — q < x < q. Tf it is 
supposed that limy—, (0) log d = 0, 
lim Sai (0) = fi (0) = f0 +), lim Sne (0) = A0) = f(O—), 


by Theorem I. But 
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771 K ` 9 
Sni (0) = LP A eT = * 2° pp sna sin(n + 4)t y 


2sin}t 2sin}t t, 
sin(n-+4)t dt =2 f oT EPa, 


— Qsindt | 


i 
soif ro a dt =} [Sa (0) +S), 


Sna (0) = = S E A 


and PEN ASA 


tim S, (0) = =- L [foH+ro—). 
The conclusion may be appended to the theorem as 

COROLLARY I. Zf f(x) satisfies the conditions of Theorem I, 
except for a finite jump at the point xo, the series converges 
at xo to the value 4[ f(a +) + flo —)]. 

So far, nothing has been said about uniformity of con- 
vergence. To lead up to a discussion of this topic, the 
following lemma will be established: 

Lemma I. Zf f(x) is an arbitrary summable function, the 
integrals 


y *y 
Arly) = f” A0 cos ktdt, Br (y) = f” 0 sin kidt, 


approach zero, as k becomes infinite, uniformly for —n <y Sr. 
If f(x) is bounded and integrable in the sense of Riemann, 
the proof holds without change of form when the integrals 
are thought of as Riemann integrals. 
In the first place, it is clear that 


limy:— w Ax (y) = limy— x Bk (y) = 


for any fixed value of y in the interval. For Ax (y)/a and 
Br (y)/« are the Fourier coefficients of a function w(x) which 
is equal to f(x) for —a <x <y, and equal to O for 
y<x < rm. Lete be any positive quantity. Let 


Ay) = J" Olat 


This function is continuous for —m < y < a, and so 
uniformly continuous. Let ð be a positive number such that 
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r sal 
A(y2) — Ayı) < gi 


if y and ys are any two points in (— m, mæ) for which 
ly2—%|<6. Let N be an integer, for definiteness the 
smallest integer, such that 27/N<4, and let z; = —a+(2ja/N), 
j =0,1,---,N. By the remark made above as to the 
behavior of A; (y) and By, (y) for fixed y, there is for each 
value of j a number i; such that | Aj (z)) < $46, | Be(z)|<4e. 
for k > kj. Let k' be the largest of the numbers ko, «++, ky. 
If y has any value in the interval (— 7, 7), there is a 2; 
such that O < y—2z;<.d. Then, for any value of k, 


Ar (y) — Ar (zj) = ro cos tat eile \f(b\dt 


TI 

AWA) <5 E, 
fo 

| Br (y) — Br (z)| < -5€ 


Hk > kK, 
Ax (y)| < | Ax (z) + | Ax (y) — Arz) <e, | Be(y)|<e. 


This is equivalent to the conclusion of the lemma. 

The existence of the period 27 for f(x) being understood 
throughout, let the definition of Ax(y) and By(y) be extended 
to all real values of y. For any yı and ys, 


Ys 
f Sb cos ktdt = Arly) — Ar). 


Here and subsequently, each relation written down for A, 
has a counterpart involving By. Let ex be defined for each / 
as the larger of the maximum values attained by Ax(y)|, 
| Bx(y)| in (— 7, a). In particular, | Az (7)| < ex, | Be(a)| < ex. 
The lemma just proved is equivalent to the statement that 
limx=% €k = 0. Because of the periodicity of f(t) cos kt 
and f(t) sinkt, the integrals defining Ax(y) and B,(y) are 
not altered in value if both limits of integration are increased 
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or diminished by the same integral multiple of 27. For 
example, if y is in the interval (æ, 37), 
| Ar(y) — Ak(7) < Ek, 
whence it follows that |Ax(y)| < 2er in this interval. If y 
and ye are anywhere in (— 7, 32), the corresponding values 
of Ar will differ by not more than 4¢;,. Finally, if yı and yə are 
any two numbers subject to the condition that |y,—y| < 27, 
there will be an integer j (positive, negative, or zero) such 
that y,+2j% and yo+2ja belong to the interval (— 7, 37), 
and 
| Ax (ys) — Ax (ys) < fek. 

A similar relation holds for By. 

The purpose of these details is to bring out a fact which 
will be used presently. For any x, and for any y in (— 7, 7), 


ee S(t) sinn(t{—ax)dt 

O fot a a 

=\cosna[ By(a+-y)—Brla—a)] —sinna[An(a+y)—Anle—)] 
=> Bên, 


or, by the substitution {—a = u, 
Y 
| f ftw sinnudu| — Ben 


for arbitrary a and y, subject to the condition that — æ < y 
<æ. (It is clear that the interval from — æ to y may be 
replaced by any other interval of length < 27.) 

The next stage of the discussion may be summarized in 
another lemma: 

Lemma Il. Jf f(x) is a summable function which vanishes 
identically for «e—yi £< B+y, with 4>O0, its Fourier 
series converges uniformly to the value 0 for a <z < B. 

The proof will be expressed in terms of the Lebesgue 
theory of integration, and, unlike those that have gone before, 
would have to be appreciably modified in form if reference 
to that theory were to be eliminated. A more elementary 
method of proof appropriate to the case of functions bounded 
and integrable in the sense of Riemann will be indicated later. 
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Let x have a value belonging to the interval œ < x< 2. 
By the usual formula, 


. = sin(n + 3) (¢—2) 
Sale) = if S(O—9 sing @— 2sin}(t—a) e 


— af £0 cot 5 (ta) sin n(t —.x)dt 


+ 5 f fÐ cos n(t—a)dt. 


The second of the two terms making up the last expression 
is equal to 4(a, cosnæ-+ bn sinnx), and approaches zero 
uniformly for all values of 2, as an immediate consequence 
of the fact that a, and bn approach zero. 

In the first term, let /—a =— u. Then the integral is 
equal to 


Tyla) = vb 2 S(ia+tu) cot usinnu du. 


Let C(u) be a function of period 2% which is equal to 
cot su for —a <u <—y and for qy <S<u<n, equal to 0 
for —}7 <u < $n, and so defined for — n < u < —}y and 
for $y < uy as to have a continuous derivative every- 
where; it is scarcely necessary to write down an explicit 
formula for the construction. As f(a-+u) is identically zero 
for —y Su <n, when z is in («, A), the integral J,(z) is 
the same as 


T 
fre +u) Clu) sin nudu. 
Let 


f Se+wsinnudu = Fy); 


the values of n and x are for the moment to be regarded as 
fixed. The function F(y) has almost everywhere a derivative 
equal to f(x+y)sinny. Consequently, as C(u) is differ- 
entiable everywhere, the function F(u) C(u) has almost 
everywhere a derivative equal to F’(w) C(w + F(u) C’(w). 
Furthermore, F(u) is absolutely continuous, and C(u), having 
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a bounded derivative, is absolutely continuous also. Hence 
F(u) C(u) is absolutely continuous, and its increment over 
an interval is equal to the integral of its derivative over 
the interval. That is, in particular, 


F(a) C(a) — F(—a) C(—a) 
=f" F' (u) C(u) du + F F(u) C' (u) du, 
In (x) = Je F' (u) C(u) du 
= F(a) C(a)— F(—a) C(—a) — (ffs Flu C hò) dus 


But C(a) = C(—a) = cot (+47) = 0. By an earlier 
paragraph, | F(u)| < 84, for —a <u <m, where é, is in- 
dependent of z; if p is the maximum of the continuous func- 
tion |C’ (u)|, which is independent of » and n, 


lone) = |f” ro C'(u) du < 167K eq. 


So Jn(z) approaches zero uniformly for the values of x in 
question, Sa (x) does likewise, and the lemma is proved. 

It follows at once that if f(x) and y(x) are two summable 
functions, identically equal for «—y Ss +y, and if 
the Fourier series for ọ (x) converges uniformly to the value 
g(x) = f(x) for x < x < B, the series for f(x) does the same. 
In particular, Theorem I may be further supplemented by 

Corouuary I. Jf f(x) is a summable function of period 21, 
and if there is an interval œ —y S< £s < B +n, 4 >0, through- 
out which f(x) is continuous, with a modulus of continuity œ(ð) 
such that limg—, (0) log ð == 0, then the Fourier series for 
J (x) converges uniformly to the value f(x) for a Sx s< B. 


2. Convergence of Fourier series under hypothesis 
of limited variation 
As the next pages will be concerned largely with functions 
of limited variation, the insertion of a proof of the theorem 
about the convergence of the Fourier series for such a function 
is not out of place. 
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Let f(x) be a function of period 22, having limited 
variation over any finite interval. Since a shift of the 
origin from which x is measured does not change either the 
character of the function or the terms of its Fourier series, 
when sine and cosine terms of like order are taken together, 
it is sufficient to consider convergence for s = 0. 

Since f(x) is of limited variation, its discontinuities, if any, 
are finite jumps. Let w(x) be a function of period 2a which 
is linear for O<a@< 2a, and further defined by the con- 
ditions w(O+) = f+), WO—) = w2a—) = f(0—), 
w(0) = 4[f0+)+/(0—)]. If f(z) is continuous for s = 0, 
w(x) is merely a constant. If f(x) is discontinuous for s = 0, 
w(x) is equal to the constant $[/(0+)+/(0—)] plus an 
odd function, and its Fourier series consists of this constant 
plus a series of sine terms, all of which vanish at the origin. 
So the partial sum of this Fourier series not merely approaches 
w(0) (in accordance with Corollary I above) but is always 
exactly equal to w(0) As f(z) = y (a) + [f@)— yo], 
the problem of convergence for f(x) reduces immediately to 
the corresponding problem for the difference /(#)— w(x), 
which is continuous at the origin, if f(x) is defined for 
x= 0 as equal to the mean of its limiting values. There 
is no loss of generality therefore in assuming at the outset 
that f(x) is continuous at the origin and: vanishes there. 
This assumption will be made henceforth. 

By the hypothesis of limited variation, f(x) can be ex- 
pressed for —a# <s <w in the form f(a) = g, (x) — pe (2), 
where y, and gs are bounded and monotone increasing 
throughout the interval, are continuous wherever f(x) is 
continuous, for «== 0 in particular, and vanish for « = 0. 
For the study of the expression 


Sn (0) = 5 jie po SMT y, 


2 sin a 
let 


wt 
Aiey sin(n+4)t 
jan J, 1 Sina 2 sin T dt, 
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let jıs be the integral similarly formed with pə in place of 9, 
and let js; and jes be the corresponding integrals from — ~ 
to 0. Then S, (0) = (1/4) (jur—jiz +21 — Jee) 

If & and § are any two numbers belonging to the interval 
(0, a), and if n is any positive integer, 


` sin(nt+Ht p| — 
J zn o e9: 


For the integrand is alternately positive and negative over 
intervals of length 7/(n-+- 4), the arches of its graph dimin- 
ishing steadily in height from left to right. The integral 
over an interval corresponding to any number of whole arches 
is therefore a sum of diminishing terms of alternate signs, 
and its magnitude does not exceed that of the largest term, 
namely the area of the first arch involved. The integral 
from any §, to any & is made up at worst of such a sum, 
plus the integrals corresponding to parts of two other arches. 
But the integrand, being equal to -+ cos t+ --- + cosnt, 
never exceeds n+4 in absolute value, and the area of any 
one arch, with base a/(m-+-4), can not exceed m. So the 
magnitude of the integral from §, to & can not exceed 37. 
(It is almost as easy to see that the value is actually less 
than 27, and a still lower bound could be obtained, but the 
inequality as written is sufficient for the purpose in hand; 
the essential thing is that the right-hand member is independent 
of &,, &, and 7:) 

If 0<d <E < & < a, the number ò being regarded 
as fixed, 


i sin(n+4)t | 3™eg 1 


~ = s Cr = = . 
g 2sin tt =n+4 33 2 sin 40 


For the absolute value of the integrand never exceeds cy, and 
the magnitude of the integral can not exceed the sum of 
the areas of three arches, each of base a/(n-+- 4) and of 
height not greater than cy. 

Let € be any positive quantity, and let ð be chosen (by 
virtue of the continuity and vanishing of y, at the origin) 
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so that p, (0) < «/(6a). The integral defining jıı may be 
taken as the sum of the integrals from. 0 to ð and from ð 
to æ. The second law of the mean may be applied in each 
of these integrals. For that from 0 to 6 it gives 


WwW 
J (fee anes e TOR N SAA Taak LFI 
0 


2 sin $4. 2 sin $¢ 


where § has some value in the interval (0, 6); by the second 
paragraph preceding, the absolute value of the integral on 
the right does not exceed 3, and consequently that of the 
integral on the left does not exceed }¢, for any value of n. 
For the rest of J, 


ix A M pi P A of sin (n+3)t yy 
su 


2 sin 2 sin $f, 


+ gı (7) $ Gr: Bt dt, 


2 sin $¢ 


the new € being in the interval (ð, 7); here g, (0) < g, (7), 
and the absolute value of each integral on the right, 
by the preceding paragraph, is less than or equal to 
3acy/(n+ 4). So 


úa sin she Uy a 


which is less than $e as soon as n is sufficiently large. This 
means that j,,; approaches zero as n becomes infinite. By 
similar treatment of fıs, jo:, and jes, it is recognized that 
S,(0) approaches the value 0 = /(0). 

For functions having the original degree of generality, 
the result may be stated in the following form: 

THEOREM II. Jf f(x) is a function of period 2 having 
limited variation over a period, its Fourier series converges 
to the value f(x) at every point where f(x) is continuous, and 
to the mean of the values approached from the right and from 
the left at every point where f(x) is discontinuous. 

If f(x) is discontinuous at the origin, the details of the 
above calculation apply properly not to f(x) itself, but to 


IA 
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the difference f(#)— w(x), the retention of the symbol f(z) 
having been equivalent to a change of notation. Let it be 
supposed now that f(x) is continuous throughout. Then the 
complication just mentioned does not arise (except for the 
subtraction of a constant to make f(0) = 0, which is of 
no consequence). The functions ,(%) and pə(x) may be 
taken as the positive and negative variations of f(x) itself 
(in the sense that pọ, (x) is the positive variation from 0 to 
æ when v is positive and minus the positive variation from 
x to 0 when a is negative, pẹ being similarly defined). The 
use of the origin as representative of an arbitrary point 2» 
in the convergence proof amounts to another change of 
notation, in connection with which (ð), in the course 
of the reasoning, takes the place of (a -+d)—gq, (a). 
Under the present hypotheses, p, and pə are everywhere 
continuous, and so uniformly continuous. Hence the choice 
of ð, if the proof is written out in terms of the general no- 
tation, can be made independently of zp. For any 2», further- 
more, 9 (% +) —9 (a) and gs(%-+7)—ge(%) can not 
exceed the total variation of f(x) over a period, a quantity 
likewise independent of z. These are the essential points 
needed to justify the supplementary assertion: 

COROLLARY I. If f(x) satisfies the hypotheses of Theorem Il, 
and is furthermore continuous everywhere, the Fourier series 
converges to f(x) uniformly for all values of x. 

If it is assumed merely that f (æ) is of limited variation for 
to—y <r <= Totty, and summable over a period, the 
function y (a) which is equal to f(x) for z—y < s£ < rott, 
and identically zero over the rest of a period, is of limited 
variation over the entire period, and its Fourier series converges 
at x» to the value 4[y(%+) + ¢(%—)] = 4[f @+)+f(%—). 
from which it follows that the Fourier series for f(x) does 
the same. If f(x) is continuous and of limited variation for 
“«—y <x < +y, and summable over a period, the 
function g(x) of period 2a which is equal. to f(x) for 
e—y <2<fB+y, and linear for 8+y<e¢<ae—y7+2n, 
is of limited variation and continuous over the entire period, 
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and its Fourier series converges uniformly with (x) for its 
sum; the Fourier series for /(a)— g(x) converges uniformly 
to zero for « <x < 8, by Lemma II; and consequently the 
series for f(x) converges uniformly to f(x) for «e < x < £. 
These conclusions may be expressed as 

CoroLLARY I. Jf f(x) is a function of period 2a, summable 
over a period, and of limited variation for x»—y < x < x+y, 
its Fourier series converges for æ = Tọ to the value 
Lfl t) +f lo); if f(x) is of period 27, summable 
over a period, and continuous and of limited variation for 
a—y < x< +y, its Fourier series converges uniformly to 
the value f(x) for a< x< B. 

It may be noted in passing that the second law of the 
mean leads to a proof of Lemma II for functions that are 
bounded and integrable in the sense of Riemann, without 
the use of Lebesgue integration. In the expression which 
was denoted by Jn(x), in the proof as given previously, let 
the integrals from —a to —¥y and from y to a be considered 
separately; the integrand is identically zero for —y < u < y. 
Since cot 4u is monotone from y to mæ, and cotia = 0, 


f fet cot su sinnudu = (cot £ a) f” fæ+sin nudu, 
Sy 2 á J7 


where § is a number of the interval (4, 7%). The magnitude 
of the last integral does not exceed the quantity 8¢,, which 
is independent of x and approaches zero as n becomes infinite. 
Similarly it may be shown that the integral from —a to —y 
approaches zero uniformly, and the conclusion of the lemma 
follows at once. 


3. Degree of convergence of Fourier series under 
hypotheses involving limited variation 
Attention will now be directed once more to questions of 
degree of convergence. The next theorem is a rather simple one: 
TueoreM II. If f(x) is a function of period 2 with 
limited variation, the total variation over a period being V, 
then, for n> 0, 
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wr r wet r 
pi S (x) cos nx dal < «äh J fæ)sinngzdz| < —. 
| J—n | m -7 n 

Let pı (x) and s(x) be the positive and negative variations 
of f(x), starting from the point —z, so that f (<) = f (— r) 
+9(2)— plx), pír) = 9(—a)=—0. Since f(a) —f(—a), 
pı (7) —go(a) = 0. But pı(n)+pə(a) = V, and therefore 
pılan) = pa (n) = 4V. By the second law of the mean, 


a ML 
ike pı (x) cosnada = n) J. cosnada 
Sal, y (Sines —smne) 
n 


Vsinn€& 


2H 7 


2 


xrcosnada, 5 š 
Similarly, 
| i A yV 
ə (7) GOS M sd < 
|J 92e) snada < on? 
while 


mt 
EE n)cosnrdx = 0. 


So the first inequality of the theorem is obtained. A mechan- 
ical repetition of the reasoning for the integral with sin nx 
would lead to the expression cosn§—cosna in place of 
sinna—sinn§&, and the new expression has the maximum 
value 2 instead of 1, since cosaa +0. But by virtue of 
the periodicity the integral from — a to æ is the same as that 
from —a+[mn/(2n)] to a+ [a/(2n)], and if the variations 
are measured from the left-hand end of the latter interval the 
upper bound V/n is obtained once more, as stated in the 
theorem. 

Suppose now that f(x) has a first derivative with limited 
variation, and let V be the total variation of /’(x#) over a 
period. By integration by parts, 


fre cos ng dr = aa sin ner I fire sinna dx 


n -r n 


= t ER i 
=-1 f'y (x) sin nz dz, 
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whence it follows further, since Theorem II is applicable to 
the last integral, that 

| ii, Ba) NE ues teh ee dale 

[Fe cos na da h Fy (x) sin nr da) < nè’ 
Similar reasoning applies to the integral with sin næ in 
place of cos næ. Repetition of the process of integration 
by parts leads to the following: 

COROLLARY I. Jf f(x) is a function of period 2m which 
has a pth derivative with limited variation, p = 0. and if V 
is the total variation of f® (x) over a period, then, for n> 0. 
fe f(x) cosna da ~ 3 i i fl sin nede < 4 ; 

-7 = prt t npr 

The conclusion is really somewhat more general than the 
statement would indicate. Suppose for simplicity once more 
that p = 1. As will be seen on re-examination of the 
proof in the light of well known theorems on Lebesgue inte- 
grals, the essential thing is not that f'(x) be uniquely de- 
fined at every point, but that f(s) be expressible as the in- 
tegral of a function g(x) of limited variation: 


fo) = F+ f vedz, 


for any value of a. This observation is of some interest, 
since the simplest functions represented by graphs with cor- 
ners satisfy the modified hypothesis, but not the original 
one. The generalization carries over to the applications of 
the Corollary in Theorems IV and Vb below. 

For the sake of another corollary, let f(x) be identically 
zero for zto— y S< x< xoy, and of limited variation over a 
period. If Sn» (7) is represented once more by the formula 


N ER vo 
Sn (xo) = if S (xo +u) cot g wsin nu du 
- -n 


ws 
aR ee 
ake om J (a +) cos nu du, 
“ —1 
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both f(v-+u) and f(%-+u) cot }u are of limited variation, 
regarded as functions of u, and consequently |S, (£o)! does 
not exceed a constant multiple of 1/n. With regard to 
uniformity of convergence, let f(x) be identically zero for 
@—y <r —8+y, the hypothesis of limited variation over 
a period being kept unchanged. When œ < 2 <, the 
function f(a +) cot 3u is identical with f(a + wu) C,(u), if 
C, (u) is defined as equal to 0 for O< |u|<, and equal 
to cot łu for 7<|u <a. In general, if ọ, (u) and ps (u) 
are any two functions of limited variation, the total variation 
of pı (u) being V, and that of gs (uw) being Vs, and if M, 
and M, are upper bounds for |  (w)| and | gs (wu) respectively, 
the function g, (u) pə(u) is of limited variation, and its total 
variation does not exceed M, V.+M.V,. In the present 
instance, p, (u) and gs(u) being replaced by f(a»-+-u) and 
C, (u) respectively, f(%-++u) has the same total variation 
over a period and the same bounds as f(x), for any value of 
zo, and C, (u) is independent of x), so that the total varia- 
tion of f(a +) cot 4u, while presumably different for dif- 
ferent values of zy, has an upper bound independent of zy, 
as long as zy») belongs to the interval («, 8). Such an upper 
bound can be calculated more specifically as the product 
of V, the total variation of f(x), by a quantity depending 
only on y. (The absolute value of f(x) can not exceed V 
anywhere, since every period contains points where f(x) 
vanishes.) It is possible therefore to state 

CoroLLaRy lI. Jf f(x) is a function of period 2m with 
limited variation, the total variation over a period being V, 
and if f(x) vanishes identically for «—y <a < B+, then 


Sor a< x<, where Sa (x) is the partial sum of the Fourier 
series for f(x), and Cy depends only on y. 

Each of these corollaries, taken in conjunction with results 
obtained earlier, leads at once to a theorem on degree of 
convergence. If » =>1 in the hypothesis of Corollary I, it 

4 
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is known that the Fourier series for f(x) converges uniformly 
to the value f(x); this follows from Theorem IX of Chapter I, 
and more directly from Corollary I or Corollary Ha of that 
theorem, since the hypothesis provides f(x) with a bounded 
first derivative, and 
|S (a) — fE) | < A |ts — a |, 

if 2 is an upper bound for |f” (x)|. On the other hand, Corol- 
lary I, interpreted in terms of the Fourier coefficients, 


states that 
lan| < V/(an?*), |ba] < V/(nnP*), 
Hence 


|F(@)—Sn (x) | 


a 


f > 0x cos ke a bk sin kx) | 


$ (Jaxl + lbr] 


k=n+1 


$ d 2V (* du 2V 
2, alr = n f wt pan? 


The conclusion may be formulated as 

THEOREM IV. Jf f(x) is a function of period ee which 
has a pth derivative with limited variation, p = 1, and 
if V is the total variation of f(x) over a period, then, for 
n>O0, 


lA 


IA 


Qp V 


St (x)— Sn (x) | < 


where S,(a) is the partial sum of the Fourier series for f(x), 
Qp is a constant depending only on p, and Q is an absolute 
constant: more specifically, Qp = 2/( prn), Q = Qi = 2/n. 

This result is to be compared with Theorem X and the 
corollaries of Theorem IX in Chapter I. 

Corollary I will be combined with Corollary I of Theorem IX 
in Chapter I, and with the theorem just obtained. In connection 
with the result from Chapter I, it is to be noted that if f(x) 
is a continuous function which is not identically constant, 
and if (0) is its modulus of continuity, 


o> 


lim inf —— 
Jd=0 
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For if this were not the case, it would be possible for every 
¢> 0 to find arbitrarily small positive values of ð such that 
(0) < €d; for any such 6, | f(a) —f(a)| S elxr:— az |, if 
vy — z, is an integral multiple of ô, since | f(a, + N6)—f (a2) | 
< Nw(d) <e. Nò, if N is integral; but for any choice of 
xv, and z, it would be possible to find values of ð satis- 
fying the condition that #(d) < «d, and so small that s — a, 
differs arbitrarily little from an integral multiple of 6; and 
then it must still be true by continuity that f(a.) — f(x)! 
< e|x —a,|, which in view of the arbitrariness of € means 
that f(%.) — f(z) = 0 for all x, and zs, contrary to the 
hypothesis that f(x) is not constant. The conclusion may be 
stated by saying that »(0)/d has a positive lower bound for 
values of ð that are sufficiently small; as (0) itself has 
a positive lower bound over any interyal not reaching to the 
origin, it appears further that @(0)/d has a positive lower 
bound over any finite range for Ò. 

Suppose now that f(x), having the period 27, is con- 
tinuous with modulus of continuity (0) for e—y < x < B+, 
but not constant over the interval, and of limited variation 
(but not necessarily continuous) over the rest of a period. 
Let g(x) be periodic with period 2a, equal to f(x) for 
e—y —x2<f-+7, and linear for 8+ y < z < e—y+2a. 
In the latter interval, the modulus of continuity of g(x) is 
a constant multiple of ð, which by the preceding remarks 
does not exceed a constant multiple of (0), say kw(d), over 
the range within which œ(ð) is defined, namely for 0< ð 
<f8—a«-+2y. If ð satisfies the latter condition, any inter- 
val of length ð is made up at worst of an interval congruent 
(modulo 27) to (8+4, «— n+ 27n), together with parts of 
two intervals congruent to (œe —, 8+ 4), and so g(x) has 
everywhere a modulus of continuity œ, (ð) which for 0< ð 
< B—«-+ 27 does not exceed (+2) (0). By the corollary 
cited from Chapter I, » (a) differs from the partial sum of its 
Fourier series by not more than Aw, (22/n) log n, and so 
by not more than A (k -+ 2) @(2a/n) log n, if n is large enough 
so that 27/n comes within the specified range of values for ð. 
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On the other hand, (2) — (x) is of limited variation, and 
identically zero for e—y < x < +n, so that the recent 
Corollary I is applicable; and the quantity 1/n which enters 
into the conclusion of that corollary does not exceed a constant 
multiple of œ(2 z/n). So it is possible to state 

THEOREM Va. If the function f(x), of period 2n, is con- 
tinuous with modulus of continuity w(0) for «—n < x < B+, 
where (6) >0 for d>0, and of limited variation (but not 
necessarily continuous) over the rest of a period, then 


S (x) — Sna (x)| < co(2a/n) logn 


for @ <a = B, ifn is large enough so that @(2n/n) has 
a meaning, Sy(x) being the partial sum of the Fourier series 
Sor f(x), and c a constant depending neither on x nor on n. 

The combination of Corollary II with Theorem IV (or 
rather with a part of that theorem) proceeds with a little 
more facility. Let f(x) be a function of period 2a having a 
first derivative with limited variation for e—y < x < B+y, 
while f(x) itself is of limited variation over a period. 
Let g(x) this time be of period 2a, equal to f(x) for 
e—y <x <+, and defined for 8+y=2<a—y+2n 
as a polynomial of the third degree so that » (x) and its 
derivative have determinate values at both ends of the interval. 
Then ọ (x) has a first derivative everywhere, which is of limited 
variation over a period, and f (2) — » (x) is of limited variation 
over a period and identically zero for e—y < s < B+y. 
It remains to apply Theorem IV to (a), and Corollary IT 
to f(@)— (a), and the following theorem is obtained: 

THEOREM Vb. Jf the function f(x), of period 2x, is of 
limited variation over a period, and has a first derivative of 
limited variation for «—y < x <= +n, then 


|f(@—Sy(x)| < + 
n 
for «@ < x < B, where Sa (x) is the partial sum of the Fourier 


series for f(x), und ¢ is a constant depending neither on 
nor on n, 
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Space will not be taken here for the working out of more 
elaborate combinations of similar character. A theorem re- 
lated to Theorem III, however, will be obtained for the sake 
of a subsequent application. 

It was shown early in the chapter that the Fourier co- 
efficients of an arbitrary summable function of period 2r 
approach zero as a limit. Let f(x) now be a function of 
period 27 which is absolutely continuous, and let ax, by be 
its Fourier coefficients. Its derivative exists almost every- 
where, and is summable. The product f(x) sin kæ is like- 
wise absolutely continuous, and so has a derivative almost 
everywhere, and its change of value over an interval is 
equal to the integral of its derivative. Specifically, it has 
a derivative equal to f'(x) sin kx +k f(x) cos kx at every 
point where f'(x) exists. Consequently 


0 = f(a) sinkn —f(—n) sin (—-k 7) 
or z 
— | P (x) sin ka dx -+ ef F(x) cos kæ dzs, 
eJ erR "E 


kak = a | St (x) sinka da. 


Ue 


= 


But by the theorem cited at the beginning of the paragraph, 
the last integral approaches zero as k becomes infinite, since 
f(x) is summable. Similarly, kb, approaches zero. 

It was seen at the beginning of the chapter that 


Mal ie 
= ig [ f(x) —Sn(x)]? da 


=1 | tremae—|$+ E atm): 


7t 


But under the hypothesis of the moment f(x), being abso- 
lutely continuous, is æ fortiori of limited variation, and hence 
Sa (æ) converges uniformly to f(x). So the left-hand member 
approaches zero, and the right-hand member must do the 
same: 


': T E 
y J {Uori = 3+ 3 city. 
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(It is still not necessary for the purpose in hand to establish 
the well-known fact that the last relation holds under much 
more general hypotheses. Use of the theorem on the con- 
vergence of the Fourier series for a function of limited variation 
can be avoided by reference to the least-square property of 
the sums S» (x), discussed in the next chapter, together with 
the fact that by Weierstrass’s theorem trigonometric sums 
Tn(x) can be found so as to make /(x)— 7,,(x) approach 
zero uniformly.) From the last two equations, taken together, 
it appears that 


zdne = B+ 


Let e be an arbitrary positive quantity. It is possible to 
choose no do that |kar| < (4e)"?, |kbr| < (E), for k > no. 
Then, if n Z no, 


Èros FS La u, 
k=n+1 


an Kk? 
which means that. 


limn > (a+b) = 
n=% k=n+1 


The results of the last two paragraphs may be summarized in 
THEOREM VI. Jf f(x) is an absolutely continuous function 
of period 2m, and if an, bn are its Fourier coefficients, 


lim LM n = 0; lim nb, = 0. 
n- n= 


If Sn(œ) is the partial sum of the Fourier series for f(x), 
and if yn is defined by the equation 


on 
m = S eso ae, 
then 
lim nyn = 0. 


n=% 


The first part of this theorem supplements Theorem III: 
the products nan, nbn are bounded for any function of limited 
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variation, and approach zero if the hypothesis of limited 
variation is replaced by the more stringent requirement of 
absolute continuity. The last part of the theorem will be 
used in the next chapter. 


4. Convergence of the first arithmetic mean 


The next undertaking will be to outline a theory of the 
convergence and degree of convergence of the approximating 
functions with which the name of Fejér is associated, the first 
arithmetic means of the Fourier series for a given function. 
The mean in question is defined by the identity 


onla) = Hoa) + Si) +--+ + Sra], 


where (a), as usual, denotes the partial sum of the Fourier 
series through terms of the Ath order. 

It will be recalled that the fundamental integral expression 
for S(x) involves a factor sin (k-+ 4) (t — x), and otherwise 
does not perk with k. The product of the expression 


sin — v + sin 5 —vp-+...+sin (n — z) v 


by 2sin $v can be rearranged in the form 


[1 — cos v] + [cos v — cos 2v]+ --- + [eos (n— 1) v— cos nv] 
= 1 — cosnv = 2sin? (nv/2), 


Application of this a with v replaced by ¢— z, gives 


sin Ftn(t — = 
o, 
n( nie =I ft t) 2sin® 4 (t — He 
or in terms of a new variable u = 4(¢ a 
1 sin? nu 
= ——— 21 o_o 
On (x) nae „fE i- sin? z qu; 


If (sin? w)/(sin*w) is denoted for brevity by ®, (u), the ex- 


pression becomes 


Sy (xz) = a oe Fle t+2w O, (u) du. 


nm Jri" 
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The fact that the trigonometric factor ®, (u) in the inte- 
grand, unlike the corresponding factor in the expression for 
Sn (x), is never negative, has important consequences, and 
materially simplifies the reasoning. If f(x) in particular is 
identically 1, each Sx (x) reduces to the single term 1, and 
6, (x) also is identically 1, for all values of n: 


1 iis sin® 4n(t— x) 1 ie 
~ nna J—z 2sin*}(t— x) = nn Jz? On (u) du. 


More generally, then, if f(x) has W as an upper bound for 
its absolute value, 

ats 
nm 
1 7/2 

-> MO, (u)du = M; 
na Jz 


— Fe} 


=n /2 
a S(a@ +2) Dn (w) du 


| On (x), < 


if |f@)| < M for all values of x, then \6,(x)| < M for all 
values of x likewise, and for all values of n. 

For any specified value of x, the identity in the preceding 
paragraph, multiplied by the quantity f(x), which is constant 
as far as the variable of integration is concerned, states that 


3 a: 1 az/2 
f@) = 55, Sro On (u) du, 
whence 
1 


on/2 
n (a) — fw) = —— J LF + 2u) — FE On (uw) du. 

Suppose now that f(x) is summable over a period, and 
continuous for # = a. Let € be an arbitrary positive 
quantity, and let ð < 7/2 be chosen so that |f (zo 2u) — f(x) 
< $e for |u| < ô. Then 


? | a 
= f eat — f(xo)] Patu) du) < | 1 eodd 
| = J-F? 


n J 
1 *n/2 1 i 
= na Jan g E On) du < 9 * 


for all values of n. On the other hand, since Øn (u) < 1/(sin®¢) 
for ô < |u| < n/2, 
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aa E + ' [f(a + 2) — f(a)] Dn (u) du | 


—2/2 
1 z2 


S |f (wo+2u) — f'(&o)| du; 


n r sin” Ò J—rz/2 


IA 


the last integral exists, by the hypothesis of summability, 
and is independent of n, so that the whole expression is less 
than $e as soon as n is sufficiently large. Therefore 
On (xo) — f(z) |<, when n is sufficiently large; on (æo) 
converges toward f(x). If f(x) is continuous everywhere, 
on (x) converges toward f(e) for all values of x, and the 
convergence is uniform. For 6 in the proof can be chosen 
independently of 2), since f(x) is uniformly continuous, and 
if M is the maximum of /(z) , 


*n/2 
| f(a +2u)—f (ao) |\du <S 2Ma, 
—n/2 
which is likewise independent of zo. 

For convergence at a point of discontinuity, let f(x) be 
summable over a period, and have a finite jump for x = xo. 
Let f(z») be defined as the mean of the limits approached 
from the right and from the left. Let a function W(x) be 
defined by the requirements that it shall be of period 27, 
and linear for a <x < xo +27, and that W(a)+) = f (to +), 
Y (zo—) = f(@o—), W(x) = f(z). For x = a, each 
partial sum of the Fourier series for w(x) is exactly equal 
to W(x), as was pointed out in substance at the beginning 
of the proof of Theorem Il. The same is true therefore of 
the arithmetic means of these partial sums. But f(x) is the 
sum of y(x) and a function continuous for «= a»; and the 
arithmetic mean formed for the sum of two functions is the 
sum of the corresponding arithmetic means. So the arith- 
metic mean on (x) formed for f(a) converges for x =— xo to 
the value /(2»). 

The results on convergence obtained thus far may be 
restated in 

THEOREM VII. Jf f(x) is a summable function of period 27. 
and dn (x) the arithmetic mean of the first n partial sums of 
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its Fourier series, as defined above, (x) converges toward 
f(x) at every point where f(x) is continuous, and converges 
toward the average of the limits approached from the right and 
from the left at every point where f(x) has a finite jump. 
If f(x) is continuous everywhere, the convergence is uniform 
throughout. 

It should be mentioned as an immediate consequence of the 
definition of the arithmetic mean that if Sn (x) converges at any 
point, 6, (x) converges to the same value. In fact, if Sp, Sı, S2, 
is any convergent sequence of numbers whatever, with limit S, 
and if on = (So S1 -+ --- +Sy—1)/n, then on also converges 
to the limit ©. Let S— Sy = Ay, and let N be chosen so 
that | Rk|<4e for k = N. Then, if n> JN, 


n S—(So+ Si + mite +Sn-1)_ 


S— Gn — 
n 
_ Pot hit i HR 
n 
= Ry + aba ta + Rya T Ry+ ES + Ra- | 
n n : 


the last fraction is less than ġe in absolute value for all 
values of n> N, and the preceding fraction, in which the 
numerator is independent of n, is less than $e for n suffici- 
ently large. 


5. Degree of convergence of the first arithmetic mean 


As a first hypothesis for the study of degree of convergence, 
let. f (x) satisfy everywhere the condition 


F(a) —f(a,)| < i| t — az |. 

In the integral expression for on(x)— J(=), 
S(a@+2u—f(@)| < 24|2u], 

so that | 

| Oy (x) — F (a) 2s fe D,(u)du = A uD) du. 
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In dealing with the last expression, it is to be remembered 
that u/sinu < a/2 for 0<u < 7/2. Let the integrals from 
O to 1/n and from 1/n to 2/2 be considered separately. 
In the former, let the integrand be expressed by the formula 
u ; sin nu 
uD,(u) = ETR sinnu- u > 


of the three factors indicated, all of which are positive for 
O<u < 1/n, the first does not exceed 7/2, the second does 
not exceed 1, and the third does not exceed n. Hence 


an _ 7h 
if uD, (u)du = 9° 
In the integral from 1/n to 4/2, 
u? 1 nm 1 
D == ——.—-.. sin’ i < . 
u D, (u) aa Se 


and 
f2 a ("du m? a 
D — = - mj. 
i uM, (wdu S| ‘i 7 n (log 5 + log) 


It follows that the whole integral from 0 to 2/2 does not 
exceed a constant multiple of logn, and | o,(”)—/f(x)| does 
not exceed a constant multiple of (log n)/n. A slightly 
more specific statement of the result is 

THEOREM VII. Jf f(x) is a function of period 27 satis- 
Sying everywhere the condition 


|J (x2) —f'(m)| S 4 |t — z |, 


A 


and 6,(x) the corresponding arithmetic mean, then, for n>1, 


f(a) — a, (x)| < CoA log n 


n 


where Co is an absolute constant. 
Let f(x) have a derivative satisfying everywhere the 
condition 
\f' @)—S' a) S |r — a l, 
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with a positive value of «. The error then can be somewhat 
more narrowly restricted. The factor ®,(u) being an even 
function of u, the substitution of — u for u gives the integral 
expression for the error the alternative form 


oy (0) — f(a) = — -> J _ [fe 20) —F@)) Onl au 
7/2 
g> = J 31/2 [f — 2u) —f(x)] Dn (u) du, 


and a third expression is obtained by taking the average oj 
this and the original one: 


27/2 


mafo =z Í [F422 Afaa] O, (wdu 
e e ni2 


By the mean value theorem, together with the hypothesis 
now in force, 

Sæt 2u — fa) = 2uf (8), 

fæ—2u)— fe) = — 2 uf (E), 
S(et+2u—2f(@+f(e@—2u) = 2u [f (&) — 7’ E), 

S(@+2u—2f@)+f@—2u)| < 24\u|-|§— 8] 
< gre+1 7 | u ati 

the numbers §, and §, being in the intervals (x, «+ 2u) and 
(x — 2 u, x) respectively, so that | &,—&| < 4|a!. Hence 


924 2 7/2 : 
om Ge) =f) Ss — Í u H Dy (u) du 


NT J-r/2 


eet yg (7P 
= 2 f u“ D, (u) du. 
nm Jo 
But 
1 ihe 
<=> Ss 
D, (u) = sin® u = 4 u? 


in the interval of integration, so that 
Qh (7° du 


an (x) — f (x)| S Fa mail ber =e 
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which does not exceed a constant multiple of 1/n, the last 
integral being convergent. In more formal statement: 

THEOREM IX. Jf f(a) is a function of period 2m having 
a first derivative which satisfies everywhere the condition 


k (Tə) —f' (ay) < 2 | Ta — Tı |E 


with «>00, and if 6) (x) is the corresponding arithmetic mean, 
then oe 
. ae 
Í (x) — On (x) a) 
where Ce is a constant depending only on «. 

No higher order of approximation would be obtained by 
supposing f (x) provided with additional derivatives. In fact, 
the arithmetic mean corresponding to the analytic function 
f(x) = cos x is or (x) = [(n—1)/n] cos a, and the error 
is actually of the order of 1/n. 

Let f(x) be an arbitrary continuous function of period 2 2, 
with modulus of continuity œ (8). The theorem to be obtained 
in this connection is perhaps of secondary interest, because 
a closer result is given for an important class of cases by 
the theorem following it; but it also covers cases not admitted 
by the hypothesis of the later theorem, and so is not entirely 
superfluous. The proof is an adaptation of that of Theorem IL 
in Chapter I. Let p(x) be a continuous function of period 2 7r 
which is equal to f(s) for a set of values of æ dividing 
a period into n equal parts, and is linear between successive 
points of this set. This y (x) satisfies the hypothesis of Theo- 
rem VIII, with 4 = [w (2 /n)]/(2 z/n), and is represented 
by the corresponding arithmetic mean with an error not 
exceeding a constant multiple of (2 z/n) log n. The absolute 
value of the mean corresponding to the difference f (2) — 9 (x) 
does not exceed the maximum of the absolute value of the 
difference itself, which is not greater than 2@(2z/n), and the 
error of this mean can not be greater than 4w(27/n). Hence: 

THEOREM X. If f(x) is a continuous function of period 2 m, 
with modulus of continuity o(d), and an (x) the corresponding 
arithmetic mean, then, for n~1, 
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- sA 2n 
| F(x) — on (a) | S Co o |- 7) log n, 
where Cy is an absolute constant. 


Let f(x) be subjected to the hypothesis that 


|f (£x) — f (æ) < Alt — zi he 


with O<@<1. From the original integral expression for 
the error, 


$ 
la (x) — f(x)| < — 2u DO, (udu 
n (x) — f( WS on Ea “© Dy (uj) di 
Qe+1 7) n/2 
= ~——— u“ Dn (uw du. 
nm 0 


Let the interval of integration again be considered in two 
parts, from 0 to 1/n and from 1/n to 2/2. In the former 
interval, 


u“ ` sin nu 7“ m\* > 
u On (u) = ——— sin“ nu- (Sn) < [5] -1-n?-* 
n (u) sin“ v sin u =A : 


so that 
ljn a / 7 \a 
ik u“ D, (uy) du < | =} m=, 


From 1/n to 2/2, 


u“ Cae S| 
u“ On (u) < — hike erie 
n(u) S sin?tu = 4 y?-?? 
E u“ On (u) du < R o. e a Me 
In < = 4 Jin 4 Jimu 4(1—a)° 


Substitution of these inequalities in the formula for the error 
gives 
THEOREM XI. Jf f(a) is a function of period 27 satisfying 
everywhere the condition 
| f(a) —J(a)| < A ža — 2 |"; 

with O<a@ <1, and if 6, (x) is the corresponding arithmetic 
mean, then 

| | Cr 2 

IFE) — ala) S Tes 


Y 


where Ce is a constant depending only on «a. 
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Two outstanding facts with regard to the arithmetic mean, 
as compared with the simple partial sum of the Fourier series, 
are that the former converges for every continuous function, 
but does not reproduce identically a function which is itself 
a finite trigonometric sum. Consideration of Theorems XI, 
VII, and IX, in the order named, throws some further light 
on the ability of the arithmetic mean to adapt itself to irre- 
cularities in the function represented, and its inability to avail 
itself of an exceptional degree of regularity. The hypotheses 
of the three theorems imply that 


Fæ) —2flx) + fle —v)| < 2 fel 


with a constant 4, in each case, and values of « successively 
less than 1, equal to 1, and greater than 1 (the present «, 
in the case of Theorem IX, taking the place of the number 
previously denoted by « +1). The upper bounds obtained 
for the error of the arithmetic mean have the orders respect- 
ively of 1/n®, (log n)/n®, and n®yn*. The corresponding 
upper bound for the error of the simple partial sum of the 
Fourier series is of the order of (logn)/n“ in each case. 
For a concluding theorem with regard to the arithmetic 

mean, let f(x) be of period 27, summable over a period, and 
identically zero for « — n <s <-+y. If x has any value 
in («, 8), f(æ+ 2u) is identically zero tor —}y < u < hy. 
Hence 

1 re —H/2 7/2 i 

Gila) = — (| +f ) fæti) O, (u) du. 
NTE 7/2 


z|2 


Since Øp (u) < 1/(sin? $4) for 44 <lu! < 4a, it follows that 
2 1 ( ye ft) aM: 
Gy (s)| < naan hy Piei le S(a+2u)\ du 


1 ar]? 
Í a \f(@+2u)| du. 


= na sin? hy J—z/2 


In terms of the variable t = «+ 2u, 


m/2 1 E a 
f etzon = 3 f Olar 


5 
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the limits of integration on the right having been adjusted 
by use of the frequently applied observation that the integral 
of a periodic function over a period is independent of the 
position of the initial point of the period. If the last integral, 
which is independent of x, is denoted by J, 


J 

\Gn(x)| < 2na sin? hy" 
The conclusion is 

THEOREM XI. Jf f(x) is a function of period 2m which 
is identically zero for a — q <x < R+, and summable over 
a period, the integral of its absolute value over a period being J, 
and if 6,(x) is the corresponding arithmetic mean, then, for 
a<r Ê, 


| Ki 
| On (x), < ae 


where C, is a constant depending only on 4. 

Taken with the last assertion in Theorem VII, this shows 
that if f(x) is continuous for «— y < x < &+-y, and summable 
over a period, the arithmetic mean associated with it con- 
verges uniformly for « < x < 8. The results obtained by 
combining Theorem XII with Theorems VIJI-XI need not be 
enumerated at length. 


6. Convergence of Legendre series under hypothesis 
- of continuity over a part of the interval 


The theory set forth in the early part of the chapter can 
be carried over in some measure to the case of Legendre 
series, the discussion being kept on the same elementary level 
which was maintained in the treatment of these series in 
Chapter I, to the extent that no use is made of an asymptotic 
formula for the Legendre polynomials. The next paragraphs 
will be devoted to a presentation of the analogies that work 
out most readily, though it will be seen that the treatment 
is left incomplete in several particulars. 

Let f(x) be summable together with its square for 
—1<2<=<1, and let ax be the coefficient of X,(x) in its 
Legendre series, 
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2k 


EL f JORO at. 


ük = 


Let 
Sn (x) = to Xo (2) + a Xi (1) + +++ + dn Xn (2), 


as before. By reference to the property of orthogonality of 
the polynomials X(x), together with the definition of a; and 
the fact that the integral of [X;.(x)]*? over the interval (—1, 1) 
is 2/(2k-+-1), it is seen that 

£ d ‘ : lg s = 2 3 
Ji [7 (x) — Sn (x)] dz = YE [F dz — Qk+1 lk: 
As the quantity on the left-hand side can not be negative, 
the sum on the right is bounded for all values of n, the 
infinite series obtained by letting n become infinite is con- 
vergent, and 


k=0 


lim ax/(2k-+-1)'* = 0, 
k=% 


or, as equivalent statements, 


3 
lim ake — 0, Tim ee f’ FO XOdt = 0. 
k=% k=% J—1 


The parallelism with the case of Fourier series becomes 
clearer if the polynomials X;(a) are replaced by the normalized 
sequence {[(2k + 1)/2]!? X;(x)}, or, more superficially, if it 
is considered that the magnitude of the coefficient is less 
significant than the magnitude of the general term of the 
series: it was seen in Chapter I that | X;.(%)| does not exceed 
a constant divided by k2, when æ stays away from the ends 
of the interval, and consequently 


Jim Ak Xx (x) ==) 3, 


uniformly throughout any closed interval interior to (—1, 1). 

Let f(x) again be summable together with its square, and 
suppose now that it vanishes identically for a)—y< x < rot, 
the interval (%—y7, 2+ 7) being contained in (—1, 1). The 
value of Sn(£o) can be expressed in the form 


5° 
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1 ~ 
Sa (to) = Spr te J ia Xn (t) dt 


| te 
ey (x) 


)dt. 


Each of the quantities Xn+1 (%0). Xn (xo) is less than a con- 
stant divided by n°, and the factor in front of each integral 
is therefore less than a constant multiple of n°; the quan- 
tity 1/(xə— t) is bounded over the range where /(f) is 
different from zero, and f(t)/(xọ— £ therefore is summable 
together with its square; consequently, by the preceding 
paragraph, the product of either integral by a quantity of 
the order n™? approaches zero. In other words, S, (7) con- 
verges to the value 0. If two functions f(x), g(x) are identical 
for so—y < x < roty, the Legendre series for their differ- 
ence cony erges ‘toward zero at the point 2», and the series 
for f(x) and p(x) themselves converge or diverge together 
at that point: the convergence of the Legendre series for a given 
function at an interior point of the interval (—1, 1) depends 
only on the behavior of the function in the neighborhood of 
the point. 

As‘a preliminary to a discussion of uniform convergence, 
let f(~) be any summable function with summable square 
over the interval (—1, 1), and let 


“y 
Arly) = E FO X(t dt. 
It is to be shown that /'* Ax(y) approaches zero uniformly 
for —1<y<1. The proof is largely a repetition of the 
corresponding argument for the case of Fourier series, but 


there are differences of detail which are not altogether trivial. 
For any fixed value of y, Ax(y) is the same as 


ile fi (t) X(t) dt, 


if A(Ð is a function equal to f(t) for —1 < t< y, and 


a for y<¢<1, and consequently k? Ax(y) approaches 


www.rcin.org.pl 


II. LIMITED VARIATION; ARITHMETIC MEANS 69 


zero tor the particular value of y in question. Let e be any 
positive quantity. Let 


Ay) = [ror at. 


This function is continuous, and therefore uniformly continuous, 
for —1<y<1. Let ð be chosen so that 


1 2 
Aly) —AW)| < (5 4) 


whenever | #/2— y, <0, yı and ys being points of the inter- 
val (— 1, 1). Let N be the smallest integer such that 2/N< ð, 
and let z; = —1-+(27/N). j =0,1,---, N: The points z 
being finite in number, there is a k’ such that 7° | Ax (z) < $e 
for all the values of j in question, whenever k > K. For 
any value of y in (—1, 1), let j be that one of the numbers 
0,---, N for which 0 < y—2z<2/N<4, and let Z = zj. 
Consider the difference 


y 
Arly) — A(Z) = Í f(t) Xet) dt. 


By Schwarz’s inequality, 
fr xe atl) < [iron ae f Moat 
= (Ay) — AZ) f Exe o} at 
< MAZI S decor at 


[ 
2 eee 1 |’ 
£ (5 . 2/(2k+1)< (5 e) k. 
Consequently 


K| Ay (y) — Ar (Z) | <S Ə €; 


This is true for any positive value of k. On the other hand, 
if k> k, k| A(Z) < 4e. For such values of k, therefore, 
12| Ak(y)| <e; and i’ is independent of y. Let « be the 
maximum of | Ax(y)| for —1 <y <1. The conclusion is that 
limz=oo. k 4. = 0. 


aly 
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Now let f(x) be summable together with its square over the 
interval (—1, 1), and identically zero for e—y < x < B+y, 
where —1<a@—y<f+y7<1. Consider the expression 


f 20580 a fi HO a, 


Lo 


on the assumption that x has a value belonging to the inter- 
val æ < x < 8; the function A,(¢) has almost everywhere 
a derivative equal to f(A X,(é). Let R(w) be a function 
equal to 1/u for |u| =y, and so defined for |u| <y as to 
have a continuous derivative everywhere. Then, for any 2 
in (@, 8), R(x— t) is the same as 1/(æx— t) at all points of 
the interval — 1 < ¿< 1 where /(#) is different from zero, 
and the integral above is equal to 


K An(t) Ræ — t) at. 


The product A, (t) R(@— t) has almost everywhere a derivative 
with regard to ¢, equal to Aj,(t) R(a—t) — A, (t) R' (x— t), 
and its increment over an interval is equal to the integral 
of this derivative, so that 


ft A}, (1) R(a — t) dt 
= [M0 Re— OP, + S An) Reh at. 


Both R(w) and R’(w) are continuous for all real values of «u, 
and approach zero as u becomes infinite in either direction. 
Let M, be the maximum of |R(w)|, and M, the maximum of 
| R'(u)|; these numbers are of course independent of x. In 
the right-hand member of the last equality, A,(—1) = 0, 
|A4n(1)| < Ens | R(a@—1)| < M, while | An (| < &, 
| R'(£ — t)| < < M,, over the whole interval of integration. 
Consequently 


1 | 
le Ah () Rz — t) dt, < Mo En + 2M, En. 


Since lim,—» n! En = 0, 
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nin fi LO%O g 
— xs—t 


approaches zero uniformly for « < x < 8. The same con- 
clusion holds if X» (t) is replaced by Xn+ı (£). On the other 
hand, »/? Xa+ (x) and n? Xn (x) are uniformly bounded over 
(«, 8). Hence, on assembling the constituent parts of the 
integral expression for Sa (x), it is seen that Sn (x) approaches 
zero uniformly over the interval in question: 

THEOREM XII. Jf f(a) is summable together with its square 
over the interval (—1,1), and identically zero over an interval 
“—y < x < +n contained in (—1, 1), and if Sn (x) is 
the partial sum of the Legendre series for f(x), then Sy (x) 
converges uniformly toward 0 for a <x < B. 

With the corollary of Theorem XI in Chapter I, this yields 
at once the further 

COROLLARY. If f(x) is summable together with its square over 
the interval (—1, 1), and continuous over the interval «—y 
<x <= B+ y, with a modulus of continuity @(0) such that 
limy—, © (ô) log ô = 0, the Legendre series for f(x) converges 
uniformly to the value f(x) for «< « <B. 

7. Degree of convergence of Legendre series under 
hypotheses involving limited variation 


It remains to consider questions of degree of convergence. 
The discussion will be based on the properties of the Legendre 
polynomials already used, together with the identity 

1 
2n+1 

Let f(x) be a function of limited variation for — 1 S 2 <1, 
its total variation being V. Let ọ,(x) and g(a) denote its 
positive and negative variations, measured from the left-hand 
end of the interval, so that f(a) = f(—1)+9,(@)—@.(a), 
pı (—1) = pa(—1) = 0. By the second law of the mean, 


Ji 9:@) Xn @) de = (0) f Xn @) ae 


Xn (x) = [Xr (x) — Xr O]. 
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for some value of § in (—1, 1). But as a consequence of 
the identity in the preceding paragraph. with the observation 
that Xati (1) = 3 PE (1) = iV 


ae 1 ` ; 
J. Xn (x) da = TES [Xn+1 (£) — Xn (a)l; 
CCIE? 
ER. 
For a sufficiently close estimate of the magnitude of the 


difference in the last bracket, there is occasion to go back 
to the identity 


Xa (a) = = p [e+ i a)? cos g dy. 


7 


Xn- ($) — Xn ©]. 


If this formula isused to express the difference Xn- (£) —Ny41(7) 
by means of a single integral, the integrand is 


[z 4i x)? cosg] {1— [x+ i(1—22)” cos g]?} 
= [z+ a) cosp 41 —a?)*[(1 —27 "(0 + cos*y)—2 ix cosy]. 


For x in (—1, 1), the absolute value of the last bracket 
can not exceed 4, and hence 
7T 


|Xn—-1(7) AEEA ta — 22/2 f; leti — a cosy-idy. 


On repeating, with slight changes of notation, the reasoning 
which led to inequalities for the Legendre polynomials in 
Chapter I, it is found that 


j2 
fetia- eos "ldo = 2f [2*+-(1—2*)cos*g]"-)2 de, 
z®-+(1—2*) cos? y < e -Pp 


for the values of the variables that come into consideration, 
if z = (2/7) (1— x°}, and 


71/2 f n 
a9 1/2 
f e—(n—1)2*p"/2 dy < 2 ew du 
0 = (n—1)* 2 Jo 


a Sea m F f ea 
Aeee eer a oe 
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With the observation that n—1 => $n for n = 2, the pre- 
ceding inequalities may be summarized by saying that there 
is an absolute constant ¢ such that 


Xna (a) — Xa4a(z)| < c/n? 


for n>1 and for —1—a<1; it is clear that the case 
n=1 can be included in the final statement. (Strictly 
speaking, of course, the result is obtained directly for |x| <1, 
and extended to the ends of the interval by continuity.) 
The essential point is that there is no factor (1—.2*)"? in 
the denominator on the right, as there was in the corres- 
ponding inequality for a single Legendre polynomial; its 
absence is due to the factor (1—2*)"? which comes out before 
the integral sign in the expression for the difference. 
Applied to the problem in hand, the inequality just obtained 
shows that 
fno nede < SO < $e, 


= 9n+1 ni? 2n”! 


Similarly, the magnitude of the integral formed with pə(2) 
in place of g(x) does not exceed co(1)/(2n*°?), while for 
n >1 the integral corresponding to the constant /(—1) is 
zero. Consequently, as p: (0)+9:(1) =V, 

Tig 5 | cV 
f f@ Xde < or 
The analogue of Theorem III is 

THEOREM XIV. Jf f(x) is of limited variation for —1 < x 
<1, its total variation being V, and if an is the coefficient 
of Xu (æ) in its Legendre series, then, for n>, 
Ry V 


nie ? 


| dn | < 


where Ry ws an absolute constant, so that 


: _ RV 
an Xn (x)| < — 
- E 
for —1 +y <s 1—y%, the constant R, depending only 
on y. 
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Now let it be assumed that f(x) has a first derivative with 
limited variation, and let the total variation of /’ (x) for 
—1<2<1 be denoted by V. (In more general terms, it 
would be sufficient that f(z) be the integral of a function 
of limited variation, and not necessarily provided with a unique 
derivative at every point.) For the moment, let 


Yaa) = f° Xa at. 
By integration by parts, 
f fa) Xn(a) dz = ti Sa) Vala) dz 

= Fæ) Ya@P,— f SE Yala) de. 
In the last member, Y,(—1) = 0, and for n > 1, Ya (1) = 0 


also. Furthermore, by the identity already employed, 


1 
2n+ 1 
the terms coming from the lower limit of integration cancel 
each other, since Xn (— 1) = X,4(—1) = (—1)""".. Hence, 
with the assumption that n > 1, 


Yn (x) = [Xn (x) =e (æ); 


1 1 
ite SO) Xue) dx = 5 -< 4 fre [Xn (2) Arnal dev. 


If the integral on the right is taken as the difference of two 
integrals, Theorem XIV, or, more directly, the inequality 
immediately preceding the statement of that theorem, can be 
applied to each. It is to be noted once more that 1/(n— 1) 
=< 2/nforn>1. The resulting general formulation, however, 
does not hold for n = 1, as may be seen by taking f(x) = Ga, 
in which case V = 0, while G may be arbitrarily large; the 
conclusion is essentially restricted to values of n > 2. In- 
duction based on repetition of the process of integration by 
parts leads to a conclusion which may be stated as 

CoroLiary I. If f(x) has a pth derivative of limited variation, 
p= 9%, and if V is the total variation of f (a) for —1 <r <1, 
then, for n>p, 
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LARS 
where Rp depends only on p, and 
Em 
ne 


|an Xp (x) | < 


Jo —1+y524<1—y7, where Ry, depends on p and on 4, 
but not on x or on n. 

Suppose f(x) is of limited variation for —1 < x < 1, with 
total variation V, and identically zero for e—y < x < B+ y. 
Let R(u), as before, be equal to 1/u for |u| >, and defined 
in some way for |u, <y so as to have a continuous derivative 
everywhere. If M, is the maximum of |R'(u)|, the total 
variation of R(u) over any interval of length 2 does not 
exceed 2M,. The quotient /(4)/(@—#) is identical with 
J R(x— t), if x has any value belonging to the interval 
«< x<. It is of limited variation, regarded as a function 
of ¢, and its total variation has an upper bound which may 
be expressed as the product of V by a quantity depending only 
on y, and, in particular, independent of x. So Theorem XIV 
may be applied to the function /(/) R(x— t), and the result 
substituted in the integral expression for S,(a), to prove 

COROLLARY IH. Zf f(x) is of limited variation for —1 <2=<1, 
with total variation V, and identically zero throughout an 
interval «@—y <x < B+ contained in (—1, 1), and if 
Sn (x) is the partial sum of its Legendre series, then, for 
n>1 ander, 


r 
Y , 


Cy 
x) | < —— 
Sn (a) | < ae 


where G depends only on y. 

Further consequences can be deduced substantially as in 
the case of Fourier series. If f(z) has a bounded derivative 
for —l<a2< 1, it comes under the hypothesis of the 
Corollary of Theorem XI in Chapter I, and its Legendre series 
converges to the value f(x) for —l<a2<1. If the series 
converges uniformly for —1 < x < 1, its sum agrees in value 
with f(x) at the ends of the interval also, by continuity. 
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So Corollary I above yields information as to the order of 
magnitude of the error, under the hypotheses indicated. A result 
may be stated for the entire interval (—1, 1), as well as for 
an interior interval, since | a, Xn (x)| < an| for —1 <z <1: 

THEOREM XV. Jf f(x) has a pth derivative with limited 
variation, p=1, if V is the total variation of f(x) for 
—1l<a2<=1, and if S, (a) is the partial sum of the Legendre 
series for f(x), then, for n >p, 


4 v a Q; y 
S@)— Sa (x) < PT 


for —1 <x <1, where Qp depends only on p, and 


nos ae 
S(2)— Sq (@)| < 


for —1+y <x < 1—y7, where Qr depends only on p and on y. 

In conclusion, the following consequences of Corollary I, 
taken first with the corollary of Theorem XI in Chapter I 
and then with the Theorem XV just formulated, may be 
stated without further comment: 

THEOREM XVIa. Jf f(x) is continuous with modulus of 
continuity (0) throughout an interval œ — q < x < B+y 
contained in (—1, 1), where w(d)>0 for d>0, and of 
limited variation over the rest of the interval (—1, 1), then 


(F) — Sn (x) | < cw(2/n) log n 


Jora < x < R, ifn is large enough so that o(2/n) has a meaning, 
Sn (x) being the partial sum of the Legendre series for f(x), 
and c a constant depending neither on x nor on n. 

THEOREM XVIb. Jf f(x) is of limited variation for —1 
<a2<—1, and has a first derivative of limited variation 
throughout an interval a—nņn < r< B+y contained in 
(—1, 1). then 

sa-m 


n 


IA 


Jor @= x= B, where Su (x) is the partial sum of the Legendre 
series for f(x), and c is a constant depending neither on x nor 
on n. 
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CHAPTER III 


THE PRINCIPLE OF LEAST SQUARES 
AND ITS GENERALIZATIONS 


1. Convergence of trigonometric approximation as 
related to integral of square of error 

In the discussion of Fourier series hitherto, scarcely any 
mention has been made of one of their most remarkable and 
important properties, namely their relation to a problem of 
least squares. The property in question is not peculiar to 
Fourier series, but is of much wider significance. 

Let po(x). pi (x), ps(w),--+, be a sequence of normalized 
orthogonal functions over an interval (a, b), that is, a sequence 
of functions satisfying the conditions 


) dh 
fv Hm edz =0 (+h), [i [prae dr = 1, 


Let f(x) be another function defined over the same interval. 
The functions px (7) and f(x) may be continuous, or, more 
generally, bounded and integrable in the sense of Riemann, 
or merely summable together with their squares. The problem 
of least squares in question is that of determining a set of 
coefficients co, +--+, Cn, for a given value of n, so that when 


g(x) = copos) + pr (@) + +++ + en Pa (2) 
the integral “ 
J; [F — 9(@)P dz, 
regarded as a function of its coefficients, shall be a minimum. 
The following theorem is well known: 


THEOREM I, The integral has a minimum value, for the 
attainment of which it is necessary and sufficient that 


b 
Ck = [rome daz, k = 05.1, ++, 2. 
77 
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It is to be noticed at the outset that no function of the 
form (x) can vanish identically over the interval unless all 
the coefficients cx are zero; in other words, the conditions 
on the p’s insure that they are linearly independent. For 


b 
J bePir= g++ +e, 


and the identical vanishing of » would require the vanishing 
of the sum of the squares. More definitely, it is recognized 
that the value of the integral on the left must be positive, 
if the c’s are not all zero. If the p’s are continuous, this 
means that there must be one or more intervals throughout 
which gy remains different from zero; if they are not so 
restricted, it means in any case that » must be different from 
zero Over a set of positive measure. 

To prove the necessity of the condition, suppose that ọ (x) 
is a function of the form X cx pr (x), in which, for a particular 
ndex k = m, 


a 
Cw + i SF (@) pm (x) dx. 
Let 
YW (a) = co polt) + ey pa (@) + +++ + (Om + A) pmax) + +++ + en pn (a) 
= g(x) +hpm (2), 
where % is a constant, the value of which will be specified 
presently. Then f(x) — w(x) = f(x) — 9 (x) —hpm (£), and 


fue — yp (a) dz =f Lf(z) — 9 ada 
2 ab 
—2n f [f(x) — 9 (2)] pm (x) den f, [pm (@)] dz. 


The last integral in this equation is equal to 1; if the others 
are denoted by J, R, S respectively, 


J = R—2h8+ hk. 
But f9@pm (a) da = cm, and 


= f Ves 1 pm (2) da = f A(@)pm (a) de —en + 0, 
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by hypothesis. Let k be taken equal to S; then 
J = R—S*<R, 


and the value R corresponding to the function ọ is not the 
smallest value of the integral, since the function w gives 
a smaller value. 

To show that the condition is sufficient, suppose now that 


b 
p = X cr pk with ck = fre) px(x) dx for k = 0,1,---, n, 
and let 


yle) = opl tH ama +--+ +h Parla), 


where at least one cj is different from the corresponding cr. 
Then o (x) — W(x) is a linear combination of po, ---, Pn, with 
coefficients which are not all zero, and 


b 
A f ly (x) — y (a) dx > 0. 
On the other hand, 
ab 
$ Jæ) pi (x) dx = oe = fe g (x) pr(x) dx, 
SUO yama de =0, k=0,1, ++, 


so that if x(x) = X c pe(x) is any linear combination of 
Pos ***s Pn, 
MORTO x@ ax 


= Daf ro — g (x)] p(x) dx = 0. 
In particular, 


U = f Fave lye) pe] az = 0. 
Hence, by the resolution f— w = (f — p) +(y — Y), 


MOO dx = p Lf) — y (a) dr +2U+V 
> f'ase dz, 
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which means that ọ actually gives the integral a smaller 
value than any other linear combination of po. +-+, Pn- 

As the most important of specific cases. po, Pi; Pz, -++ may 
be the sequence of functions 


1 cosg sine cos 2x sin 27 
(2n)? a? mle ? al ? mile 


If n above is replaced by n’ = 2n-+- 1, ọ (x) is a trigonometric 
sum Ta(x) of the mth order. A periodic function f(x) being 
given, the condition that the integral 


Í ki X U) — Tala)? dz 


shall be a minimum, is that the coefficient of cos ka/m'? be 


az =F 
Jiro cos A dt 


m! 


for k>0, with a corresponding determination of the constant 
term and the sine terms, and this is equivalent to saying 
that Ta(£) shall be the partial sum of the Fourier series 
for f(x). 

This property of least squares may be taken as a point of 
departure for a discussion of the convergence of the series, 
under suitable hypotheses with regard to f(x). While the 
hypotheses are more restrictive than those used in the con- 
vergence proofs already given, the method is of interest for 
its own sake, and furthermore lends itself to a remarkable 
variety of generalizations in other directions. It depends on 
theorems of approximation from Chapters I and II, and also 
on a proposition, due in the first instance to S. Bernstein, 
the simplicity and importance of which are noteworthy in 
the highest degree: 

BERNSTEIN’S THEOREM. Jf Ta(x) is a trigonometric sum of 
the nth order, and if L is the maximum of its absolute value, 
the absolute value of the derivative Tj (x) can not exceed nL. 

The proof to be given here was devised independently by 
Marcel Riesz and by de la Vallée Poussin. For the purposes 
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of the demonstration, the theorem may be restated in the 
following equivalent form: 

If Tn(x) is a trigonometric sum of the nth order, and if 
the maximum of | Ta (x)| is 1, the maximum of | Tala) | itself 
can not be less than 1/n. 

Suppose the maximum of |7,(7)| were less than 1/n. 
Then, for any value of the constant C, the function 


Ry (x) = 4 cos (nz — C) — Ta (x) 


would have the same sign as cos(ma—C) at each of the 
points C/n, (C-+-2)/n, (C+22)/n, ---, (C/n)+2m, where 
cos (na—C) takes on the values +1 and —1 alternately. 
Hence R(x) would vanish at least once in each of the 
2n intervals between successive points of this set, say at 
the points Tı, £ə.---, Xon, all contained in an interval of 
length 27. By Rolle’s theorem, R(x) would vanish between 
wv, and rs, between zs and zg, etc., and also, as R,(a,+27) 

Ry (a) = 0, between za, and 2,+ 22; and the 2» distinct 
roots of Ry thus specified all lie within the interval from 
a, to +22. Explicitly, 


Ril) = — sin (nz —C)—Ti(2). 


If C is chosen so as to make sin(ma—C) equal to the 
negative of T(x) at a point where Ty (x) = +1, Ri (x) will 
have a double root at this point; for each of the functions 
sin(na—C), Ta (x) separately attains a maximum or a minimum 
there. Having 2n distinct roots in a period, and at least 
one double root, R, has roots of aggregate multiplicity at 
least 2n-+-1. But this would require that 24, as a trigonometric 
sum of order n, vanish identically, which is impossible, since 
FR, is sometimes positive and sometimes negative, by hypothesis, 
and so can not be constant. Consequently the assumption 
that | Ta (x)| remains less than 1/m is inadmissible. 

(The fact that a trigonometric sum of order n can not 
have roots of multiplicities aggregating more than 2” without 
vanishing identically is assumed here as known. A special 


6 
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case of it, sufficient for the present application, is proved 
incidentally in another connection in Chapter IV below.) 

To proceed with the proof of convergence, let f(x) be 
a continuous function of period 27, and S,(x) the partial 
sum of its Fourier series to terms of order n, and let 


TK ss die [f() —= Sn (E) a dz. 


Let tan(x) be a trigonometric sum of the nth order, sub- 
sequently to be chosen so as to furnish an approximation 
to f(x), but arbitrary for the present, and let 


f@—t(z) = r(x), Sq (a) — tn (x) = ty(x). 
Then *+—vr, is identical with f— Sn, and 
we 
site [r (x) ENEN (9) i dx = Yn. 
Let €» be an upper bound for the absolute value of r(a): 
r(x) = |f@—tr@)| £ En 


for all values of x. Finally, let øn be the maximum of 
‘tn (x)|, and a a point where | t,(a)! = pn. 
By Bernstein’s theorem. 


| Ta (2) | < N tin 


for all values of z. For |a—a |< 1/(2n), consequently, 
' 1 1 
| ty (£)— Tn (Xo) | < g Mn | tn (x) | = > Mn 


If £n < }m (the contrary case being reserved for separate 


mention), 


1 ` 1 
Ir@æ)| < 4h |r (£)— Ta (x)| => 4h 


throughout the interval specified. As the length of the 
interval is 1/n, 
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, ay toate \12 te > LL (Hn\° 
in fr (x) t(x)] da = (£) ` 


whence un < 4(nyn)?. To suppose, on the other hand, that 
én > tn, is to assume outright that «,<4¢«,. So in either 
case un does not exceed the larger of the quantities 4¢,. 


4(nyn)"?, or, in a single formula, 
fn ae 4(nyn)? + 4é,. 


But |tn| < fn, |r| < En, and r— tn is the same as f'— Sn. 
Consequently 

fŒ) — Sn (x) < 4 (n Yn)? + 5é, 
for all values of r. 

The function f(x) being continuous, it is always possible 
to choose tan(x) for successive values of n in such a way 
that lim,—«. én = 0. So the relation just obtained amounts 
to a proof of 

THEOREM IT. Jf f(x) is a continuous function of period 27, 
its Fourier series will converge uniformly to the value f(x), 
provided that 

lim nyn = 0. 


n= 
In particular, Theorem VI of Chapter II yields at once 
COROLLARY I. A sufficient condition for uniform convergence 
is that f(x) be absolutely continuous. 
By Theorem II of Chapter I, on the other hand, if œ (ð) 
is the modulus of continuity of f(s), there will exist sums 
ta (x) such that 


T(x) — tar (1) | < K'o (27/n), 


where K’ is an absolute constant. Then 


fire —t, (a)? da < 27 [K'a (2a/n)]?. 


But as the integral takes on its minimum value when Sn (æ) 

is put in place of ¢,(x), the right-hand member of the last 

relation is an upper bound for yn. So ny, will approach 

zero if n!’ w(27/n) approaches zero, and it is possible to state 
ee 
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CoroLLARY II. A sufficient condition for uniform convergence 
is that limg_9@(0)/d"? = 0, where w(0) is the modulus of 
continuity of f(x). 

The conditions of Corollaries I and II of course overlap 
to a considerable extent, but they are not coextensive, and 
neither includes the other. As previously mentioned, they 
are less general than others which have already been obtained; 
absolute continuity is more stringent than limited variation, 
and the requirement involving 6? is more restrictive than 
the Lipschitz-Dini condition. 

The novel possibilities of the present method will be more 
apparent, if it is remarked that in the proof of Theorem I, 
down through the final relation of inequality preceding the 
statement of the theorem, no use whatever was made of the 
assumption that S, (2) was the Fourier sum for f(x), and 
the value of the integral y, a minimum; the argument can 
be repeated step by step with Sn (x) replaced by Ty (x), an 
arbitrary trigonometric sum of the nth order, and y» replaced 
by gu, the integral of [f(<)— Tn (x)]* over a period. The 
conclusion is 

THEOREM Ila. Jf f(x) is a continuous function of period 
2a, Ta (x) an arbitrary trigonometric sum of the nth order, 
and 


Gn =|" row- Tn (a)l dx, 


and if there exists a trigonometric sum tn (x), of the nth order, 
such that 
f(x) —ty (x)| = En 


for all values of x, then, for all values of x, 
(av) — Tr (£)| < 4 (nga) + Sen. 


To this may be added immediately 

THEOREM IIb. Jf sums Ty (x) are defined for infinitely 
many values of n in such a way that gn < Ayn, where A is 
independent of n, the sums Tn (x) will converge uniformly to 
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the value f(x), as n becomes infinite, under the conditions of 
Theorem II or either of its corollaries. 

The next application depends for its simplest expression on 
an existence theorem, the proof of which will be postponed 
for a few pages, to avoid interruption of the current order 
of ideas. The statement of the theorem is as follows: 

THEOREM III. Jf ọ(x) is a summable function of period 2a 
which is nowhere negative, and is different from zero over 
a point set of positive measure in a period, if f(x) is a sum- 
mable function of period 2a such that o(x)[f(x)]* is also 
summable, and if T, (a) is a trigonometric sum of specified 
order n, the integral : 


f 8 LS) Tr @)P dx 


has a minimum, which is attained for one and just one deter- 
mination of the coefficients in Ty (a). 

The truth of this assertion being assumed for the moment, 
attention will be directed to the question of the convergence 
of Ta (x) toward f(s) as n becomes infinite, the function f(a) 
being supposed continuous. The problem is connected with 
an extensive theory developed from a different point of view 
by Stekloff, Szegé, and others. The minimizing sum may be 
called the approximating sum of order n corresponding to the 
weight function ọ(x), and it will be understood now that 
T,,(x) denotes this particular sum. The corresponding minimum 
value of the integral will be denoted by G,. For the con- 
vergence proof, let o(a) be further restricted by the hypo- 
thesis that its values are comprised between two positive 


bounds: 
0<v < olz) <T, 


where v and V are independent of x. 
By the minimizing property of T, (a), 


Gh =f eOU@— Trae 


< J" e@) Le) S ade < Vrn 
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where Sn (x) and yn have their previous meanings. On the 
other hand, with the notation of Theorem IIa, 


rd 
Ga = vf [E Ty (aw)? da = vgn. 


So gn < (V/v)yn, and the conclusions of Theorem IIb are 
applicable: 

THEOREM IV, Jf the weight function o(a) has a finite upper 
hound and a positive lower bound, the approximating sum T(x) 
will converge uniformly toward f(x) as n becomes infinite, if 
lim, —« nyn = 0, and hence, in particular, if f(a) is absolutely 
continuous, or has a modulus of continuity (0) such that 
limy—,  (0)/d"? = 0. 

In modification of this result, let the restriction lọ (x) < V 
be removed, the hypothesis with regard to ọ being merely 
that it is summable, and that ọ > v > 0 everywhere. Since Gh 
is a minimum, 


Gy < ibe olx) [Lf (x) Tor ty (x)]? da 


if t (x) is any trigonometric sum of the nth order. Let €n, 
be an upper bound of |f— #,|. as before. Then 


Gz e f eeds = Je, 


with the notation fe = 1. As it is still true that Gn => vgn, 
it follows that ga < (//v)&. Under the hypothesis that 


limy—,  (d)/d"* — 0, t,(x) can be chosen so that lim, —æ n"? €n 
= 0, which means that liMmn=% ngn = 0, and Theorem Ia 
shows that Tn (x) converges uniformly to f(x) once more. 


2. Convergence of trigonometric approximation as 
related to integral of mth power of error 


To generalize in another direction, consider the integral 


ifr =? (x) | f(a) — Th (x) |" dx, 


in which m is an arbitrary positive number, taking the place 
of the particular exponent m = 2 that has been used hitherto. 
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When m, n, o(x%), and f(x) are given, f(x) being bounded 
and measurable, it can be shown that there exists a deter- 
mination of the coefficients in Ta (Œ) for which the value of 
the integral is a minimum, provided that ọ(x) is summable, 
everywhere positive or zero, and different from zero over 
a point set of positive measure in a period. When m>1, 
the determination is unique. The existing presentations do 
not give the theorem with quite this degree of generality, 
but it is not difficult to supply the necessary extensions. For 
somewhat less general functions ọ (in particular, for the case 
that ọ = 1 identically) it has been shown that the deter- 
mination is unique for m = 1 also, if f(x) is continuous. 
It is not generally unique when m<_1. The proofs will be 
omitted here, the facts with regard to convergence being 
formulated in such a way as not to presuppose a knowledge 
of the theorems of existence and uniqueness. 

For the discussion of convergence, let f(x) be a continuous 
function of period 2, and let it be supposed that there is 
a number v such that 


ola) > v>0 
for all values of x, while ọ (x), as always, is summable over 
a period. Let 
r u e 
dn e F (x) if (x) = Ti (x) m dx, 


Ta (a) being for the present an arbitrary trigonometric sum 
of the nth order. (The notation is changed somewhat from 
that previously used in treating the case m = 2.) Let it be 
assumed that there is a sum ¢, (x), likewise of the nth order, 
such that 


ST (x) — ty (x) < En 
for all values of x, and let 
fæ) — ty (x) == f (x) ; Ta (2) — ta (x) gop (x) , 


whence 


T o (x) | r (£) — Tn (x) |" dx = gn. 
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If un = | ta (£o) | is the maximum of | 7, (æ) |, it follows that 
| tn (£)| < nun, by Bernstein’s theorem once more, and there 
is an interval of length at least 1/n throughout which 


| 1 1 
| Tp (x) — Thr (xo) | < g An Tn (a) | = 9 Hn, 


r (2) — ta (@) | > -F tn, 


provided that n < }m,. Then 


v [Mn \™ iim 
In = ry FAR : ln S 4 (ngn/v) . 


Whether en < } yn or not, 


ln = 4A(ngn/v)'™ +4 &,, 

and 

f(a) —Tn (x) | = |r (E) — ta (£) | < 40 gn/v)/™4+ 5 En. 

For each value of n, let yn be the greatest lower bound 
of the integral gn, when all possible values are given to the 
coefficients in Tp (x), while the functions ọ and f and the 
exponent m are held fast. It is clear that 7, > 0. Nothing 
is assumed as to the possibility of making g» actually equal 
to yn. by one or more determinations of the coefficients. 
As gn can not be less than yn» for any sum of the nth order, 
it follows in particular, since ¢, (7) is such a sum, that 


Yn S fre (x) f(x) — tnr (x) |" da < Le, 


the value of fe being denoted by Z. If sums 7), (x) are 
chosen for the successive values of n so that gn < Ayn, where 
A is independent of n, then gi” < (AJ)'"e,, and 


O=] = Bn'2,, 


where B is likewise independent of n. The sums 7), (x) will 
converge uniformly toward f(x), if n!e, can be made to 
approach zero, and this will ‘be possible under conditions 
indicated by Theorem IJ of Chapter I, if m>1; by the 
Corollary of Theorem IV of that chapter, if m = 1, and by 
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Theorem IV itself, if 0<m<1. The principal results may 
be summarized in the following formal statement: 

THEOREM V. Jf the weight function ọ(x) is summable and 
has a positive lower bound, and if a sum Tn (€x) of the nth 
order is chosen for each of an infinite set of values of n so 
that gn € Ayn, where A is independent of n, these sums will 
converge uniformly toward f(a), when m is held fast and n is 
allowed to become infinite, if m>1 and f(x) has a modulus 
of continuity (0) such that limg—9o(d)/d’" = 0, or if 
m= 1 and f(x) has a continuous derivative. 

The explicit statement for m<1 is more complicated and 
less interesting. In the cases covered by the theorem as 
stated, for m => 1, there is in fact a determinate approximating 
sum for each value of n, making gn = yn, and the conclusions 
of course apply in particular to the convergence of these 
approximating sums. On the other hand, it is clear that A 
can be replaced by a factor An which becomes infinite with 7, 
if more restrictive hypotheses are placed on f(x), so that ên 
can be made to approach zero with sufficiently increased 
rapidity. 


3. Proof of an existence theorem 


To carry out an intention expressed above, it remains to 
supply a proof of Theorem III, with regard to the existence 
of an approximating sum corresponding to a given weight 
function o(x) when m = 2. 

It will be worth while to begin with more general con- 
siderations. Let go(x), qı (x), qe (£), «++, be a sequence of 
functions, each summable together with its square over an 
interval (a, b). Let these functions be linearly independent, 
in the sense that any linear combination of a finite number 
of them, with coefficients not all zero, is different from zero 
over a point set of positive measure in (a, b), which is equivalent 
to saying that the integral of the square of such a linear 
combination, extended from a to b, is positive. The requirement 
may be indicated briefly by saying that the q’s are properly 
independent. Let 
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Q = i N Qo 


and generally 


ka Í ak Qj 
Ok = dk- Pa Qj - ; 


it is understood that each integration is to be performed 
with regard to x, from a to b. Each denominator is different 
from zero, because of the hypothesis of proper independence. 


It is seen from the definition of Q, that fQ Q, = 0. Since this 
is the case, the definition of Qə makes f Q Q; = fa On. =; 


Generally, each Q is orthogonal to all the preceding Q’s, or, 
in other words, any two of the Q’s are orthogonal to each 


other. If a 
mæ = affa)”. 


the functions px form a normalized orthogonal sequence, to 
which Theorem I is applicable. Each px is a linear combina- 
tion of qo, +-+, qk, and conversely each gq, can be linearly 
expressed in terms of py and the p’s of lower order, since 
the coefficient of qx in the expression for px is not zero. 

Now let ọ(x) be a non-negative summable function of 
period 27, such that 


us 
É -2 (@)dx > 0, 
and let [ọ (x) 2 w(x). The functions 
w(x), wlx)cosx, w(x)sinx, w(x)cos2æx, w(x) sin2z, --- 


are summable, with their squares, and properly independent; 
a linear combination of a finite number of them, with co- 


www.rcin.org. pl 


III. PRINCIPLE OF LEAST SQUARES 91 


efficients not all zero, is a trigonometric sum multiplied by 
w(x), and is different from zero wherever ọ + 0, with the 
exception of a finite number of points at most in a period. 
If they are taken as the functions q, qi. q2, ---, of the 
preceding paragraph, over the interval (—a, 7), the corre- 
sponding functions po, Pı, Pe, ---. have the form 


wx) Ula), wx) Uy (a). wa) Vila), w(x) Ue (er), 
wx) Va læ), =+; : 


where Ux and V; are trigonometric sums of the kth order, 
and 


Few Uj(a) Ukl) d£ = few Via Vilaedda =O (j +h, 
few Uj (x) Vr(x) da = 0 for all 7 and k, 
oz ar 
fre (x) [Ux(@) dx = few PEOP dx = 1. 


It 7, (x) is any trigonometric sum of the nth order, w(x) Ta (x) 
is a linear combination of the functions now taken as the q’s, 
and so is a linear combination of the functions w(x) U;(a), 
w(x) Ve(£), k= 0,1,---,, and Tn itself is expressible in 
terms of Ux, Vi, with the same coefficients. 

Let f(x) be a summable function of period 27, such that 
o(x) [f(x)]* is also summable. Then 


n 
few [LF Tr) de = F wora — w (a) T da, 


and Theorem I can be applied, with w(x) f(x) as the function 
for which an approximation is sought. The integral has 
a minimum value, and if 
Tu(x) = Ag Uy (x) + Ar U1 (2) + -++ + An Un (x) 
-t B, Vi (x) -+ LAA + Bn Vn (a), 


it is necessary and sufficient for the attainment of the minimum 
that 
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Ák -=f w(x) f(æ) - w (£) Ur (£) da = few Jle) Urle) dx, 
Br = Few T(x) Vr(x) dz. 


4. Polynomial approximation 


Attention will next be directed to problems of polynomial 
approximation. An essential instrument here is the form of 
Bernstein’s theorem in which Bernstein himself was primarily 
interested: 

BERNSTEIN’S THEOREM FOR POLYNOMIALS. Jf Py (a) is 
a polynomial of the nth degree, and L the maximum of 
| Pa (x)| for —1 <x <1, then 
4 ; nl 
| Py (æ) | = (l oT 
Sfor-—1<xr<l. 

It can be deduced immediately from the theorem for the 
trigonometric case. Let x —cosé@. Then P,(x) = Py (cos 8) 
is a trigonometric sum of the nth order in 6, having L for 
the maximum of its absolute value, and consequently 


a a P 
FE P, (cos 0) | = |sin @ Px (cos 6)! = | (1 — x°)? PI (| < nL, 
whence Px(a)| < n L|(1— 22). 

If L is the maximum of | Pa (œ)| for a < x < b, let 
é 2r—a—b 
ig b—a 
Then P, (a) = Qn{y) is a polynomial of the nth degree in y, 
having L for the maximum of its absolute value in the 
interval —1<y<1. Therefore 


nl 


dx | be a nb. 
(1— yr?’ 


=e Pa (x)| < 


d “he 
| iy Qn (y)| = |B: i z 


KY 
by substitution of the value of y in terms of z in the factors 
(1—y), (1+y) under the radical sign, one is led to 

COROLLARY I. Zf P(x) is a polynomial of the nth degree, 
and L the maximum of P,(x) for a< s< hb, then 
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nL 


Palæ) | < = -de-a 


for a<a<b. 


This implies directly 

Corouuary Il. If P(x) is a polynomial of the nth degree, 
and L the PS of (ROV Jor a<a<b, and if 
“<= a, <b <b, then 


Ph(x)\ < hinL 


for a< s< b, where h depends only ona, m, b, and b. 


A further corollary is to be obtained by the following 
considerations. Let P(x) once more be a polynomial of 
the nth degree, and L the maximum of its absolute value 
for —1 <a2< 1. Let x be a number of the interval 
—1l<a2< 1, and ô a positive number <1 such that 
—1<a—d<1. Then 


Pa (x) E Pr (x ra 0) == Es Pn (t) ats n a (1 Ai D ada Re 


It is readily seen that 


R > dt = ih dt 
P NR (1 — £) y; (1— £)? . 


(For an analytical proof, let ð = 27, e—4d = y, so that 
a—d =y—y, £=y +h, 


JE dt iPS a 7 7 at 
rg (L— FP)? y — Py ‘ 
Since 


iy RE er) EIEN 
dy Jy-, U— E) fi— y+] [1 — y — 


which is equal to 0 if y = 0, positive if y>0, negative if 
y< 0, the value of the integral is a minimum when y = 0, 
that is, when the interval of integration is symmetric about 
the origin, and increases steadily as the interval is displaced 
toward either side.) Asô<1, O<Ł<1 in the integral 
from 1—d to 1, 1—# 2>1—t=0, and 
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a cL A i GEEA E 9 2 
Ji-¢ I— 8)? = J, A) y 


Pa(x) — Py (a —8)| < 2n LH, 


So 


An equivalent statement is that if 7, and zə are two numbers 
in (—1, 1), differing by not more than unity, 
Pa (x2) — Pu (a,)| < 2n L| ta — a |". 
Now suppose that ZL is the maximum of | P, (x) for 
a= x< b, and let x, and z be two numbers of this interval, dif- 
fering by not more than 3 (b— a). Let y = (22—a—b)/(b—a), 
the values of y corresponding to 7, and z being yı and ys, 
and let P,(x) = Qn(y). Then L is the maximum of lQn (y) 
for —1<y<1, yı and ys are two numbers of this interval, 
differing by not more than unity, and 


Pn (a2) — Pa (21)| = Qn(y2) — Qn (ys) << 2n L | yo—y |"? 
= 2n L [2 | za— a |/(b—a)]"?. 
The conclusion is 
CorRoLLARY II. Zf Py (x) is a polynomial of the nth degree, 
and L the maximum of | Py(x)\ for a< x< b, and if a, 
and x, are two numbers in the interval, differing by not more 
than 4(b—a), then 
Pp (a2) — Pa (%1) | < Hn L| xre— a, |°, 


where H = 2/[} (b — a)}!. 
By the introduction of an intermediate value rs = $ (æ, + zə), 
it may be seen that 


4n L : 
| Paes) — Pala) | < g pT | 


without the restriction that |z» — a| < 4(b—a). 
The way has now been prepared for a discussion of 
problems of approximation associated with the integral 


Jie Sa Pal) dz, 
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where ọ(x) and f(x) are given functions in the interval 
(a, b), Pa(z) is a polynomial of the mth degree, and m is 
a given exponent > 0. 

When m = 2, a theorem of existence and uniqueness may 
be stated as follows: 

THEOREM VI. Jf ọ(x) is a summable function which is 
nowhere negative. and is different from zero over a point set 
of positive measure in the interval (a, b), if f(x) is a summable 
function such that o(x) [f(a)]*? is also summable from a to b, 
and if P,(x) is a polynomial of specified degree n, the integral 


fe) Lea) — Pall de 


has a minimum, which is attained for one and just one de- 
termination of the coefficients in Py (x). 
The proof is entirely analogous to that of Theorem HNI. 
The functions 
w(x), xw(x), wlz), ---, 


where w(x) = [e(x)]'", are summable together with their 
squares and properly independent over the interval (a, b). 
If they are taken as the functions q(x) previously considered, 
the corresponding functions p(x) have the form w(x) Y;.(a). 
where Y;(~) is a polynomial of the kth degree, and 


fre) Yj (x) Y¥x(x) dx = 0 (j + k), 
feo [F(a] de = 1. 


The polynomials Yx(x) are called the Tchebychef polynomials 
corresponding to the characteristic function o(x) in the inter- 
val (a, b). Any polynomial P, (Œ), of the nth degree, can be 
expressed as a linear combination of Yo, Fi, ---, Yn, and 
for the minimizing of the integral in the statement of the 
theorem it is necessary and sufficient that the coefficient of 
Yx in this expression be 


few se) Yx(x) dz. 
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In the case that ọ is identically equal to 1 and the interval 
is that from —1 to +1, Yz(x) is equal to (t+ $)” Xi; (a), 
where Xj, is the Legendre polynomial of the kth degree; and 
the approximating polynomial P,,(7) then is the partial sum 
of the Legendre series for f(x). 

For an arbitrary value of m > 0, theorems of existence 
and uniqueness can again be established in analogy with those 
of the trigonometric case. An approximating polynomial, 
reducing the integral to a minimum, exists if f(a) is bounded 
and measurable, and if ọ(x) is summable, nowhere negative, 
and positive over a point set of positive measure in (a, b). 
The approximating polynomial is unique when m>1, and, 
with suitably restricted hypotheses, when m= 1 also. The 
proofs will not be given here, the further discussion being 
put in such a form that a knowledge of them is not essential. 

Let f(x) be a continuous function for a < £ < b, and P,(a) 
an arbitrary polynomial of the mth degree. Let g(x) be 
summable over (a, b), and nowhere negative in the interval, 
and let o (x) > v > 0 for æo < x < Bo, where a < æo < By <b. 
(It will be noticed, as a departure from the conditions of 
the earlier convergence proofs, that the hypothesis o > »>0 
does not necessarily hold over the entire range of variation 
of x.) Let 


b 
In = f o (x) | f (£) — Pr (x) I aa. 
Furthermore, let pn(x) be a polynomial of the mth degree, 
in general distinct from P, (x), 
F(x) — pa (£) 
for a<zr< b, and 
S (x) —pn(v) = r(x), = Pax) — pale) = mo), 
so that 


a 
= En 


fe) remna de = gu. 


Suppose that un is the maximum of | 7, (a)! for @ < x < Ao. 
and that 2 is a point in (œo, £o) at which this maximum is 
attained. By Corollary III above, 
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ty (£) — Tn (%)| < Hn pn x — zo! 


as long as x is in (@, £o) and distant from 2» by not more than 
$ (Bo — ap), the factor H being equal to 2/[4 (8) —«)]'*. These 
conditions will be satisfied for n = 1 throughout one or the 
other (or both) of the intervals (a — 0, a»), (£o, £o + ò), if 


ò — = Si = Bo— &o 
4H* n? .32 n” 


Over such an interval, then, since |x — zo"? < 1/2 Hn), 


1 1 
Tn (x) — Tn (a) | < p Mns Tn (x) 2 g Mane 


If En < flin, 


(0) — an (a) > Eun 


over the same range; and, as the length of the interval is 
1/(4 H° n’), 
v u m 
gu = 4H? (c) A 
lin < 4(4 H? jy (n? gn)! ™ = Bo (n? gn)!™, 


with Bo = 4(4H*/v)!™. Whether en < un or not, 


lin < Boln? gn)! + 4en, 
F(a) — Pa (@)| = |r (£) — an (x)! < Bo (n? gn)! dey 
throughout («&o, £o). 
Let yn be the greatest lower bound of ga, when ọ (2x), f(x), 


and m are given. As an immediate consequence of the 
definition, 


TES few fomno ar S T T= f o(x) dx. 


If polynomials P, (x) are chosen for an infinite succession of 
values of n so that gn < Ayn, where A is independent of n, 
then gi < (AT) ™ e, and 


f(e) = Pa (æ)! < Bn?™ Ens 
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for «o < xz < ĝo, with a factor B which is likewise inde- 
pendent of n. 

The conditions to be imposed on f(x), in order that it may 
be possible to make n?” ¢, approach zero, are to be obtained 
from Theorems VI and VJI of Chapter I, and the Corollary 
of the latter. It will be sufficient to state the results form- 
ally for m > 2, though corresponding statements could readily 
be added for O<m<2: 

THEOREM VII. Jf o(x) is summable over (a,b), and has 
a positive lower bound for œo < x < Bo, where a= «o< Bo 
=< b, and if a polynomial P, (x) is chosen for each of an 
infinite set of values of n so that gn <= Ayn, where A is inde- 
pendent of n, these polynomials will converge uniformly toward 
f(e) for @ < £= Bo, when m is held fast and n is allowed 
to become infinite, if m>2 and f(x) has a modulus of con- 
tinuity w(0) such that limg— @(0)/0?" = 0, or if m = 2 
and f(a) has a continuous derivative. 

Under the conditions formulated, there will in fact be one 
and just one polynomial of each degree n for which gn = Yn, 
not merely for m = 2, in accordance with Theorem VI, but 
also for an arbitrary m>2, under a theorem cited above 
without proof. Theorem VII naturally applies to the con- 
vergence of these approximating polynomials. More particular- 
ly still, for ọ(x) = 1, a = —1, b = 1, one may state 

CoroLLARY I. Lf f(x) has a continuous derivative for 
—1<x2=1, tts Legendre series converges uniformly toward 
J (x) throughout the closed interval. 

Another part of the content of the theorem, aside from the 
question of uniformity of convergence, is expressed in 

Corouuary I. Jf ọ(x) is continuous and nowhere negative 
Jor a< x< b, the hypotheses of Theorem VIL remaining 
otherwise unchanged, the polynomials Pa (a) will converge toward 
J (x) at all points where o(x) + 0. 

Additional information can be gained by further elaboration 
of the proof of the theorem. Let the previous notation be 
continued in force, with the understanding now that poly- 
nomials P(x) and p,(x) are defined for all positive integral 
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values of n, and that these sequences of polynomials are 
kept unchanged throughout the course of the reasoning. Let 
« and 8 be two numbers such that o< œ << o. Let 
Či, Œa, +++ and i, 2, --- be two infinite successions of 
numbers such that 


<a Og < +s Ca, Bmwer CPe< Bi < fy. 


Let vn be the maximum of ,(x) for «œ < x <, and tnk 
the maximum of the same function for a, < £ < Bx; fno is 
the same as the previous mn. It follows immediately from 
the definitions that 


lno = Hm = Unm Z +--+ Z Vna: 


Until the contrary case is explicitly mentioned, it will be 
assumed that en < ł}vn. Then, a fortiori, €n < uno, and 
tno < Bo(n? gn)’, by the earlier work. If gn < Ayn, as will 


be assumed from now on, g}™ does not exceed a constant 
multiple of £n, and 
' Mno < Ag n7im Ens 


where Ao is independent of n. 
Suppose it is known, for a specified value of k, that 


Mak < Ak mn? En, 


Ax being independent of x. The exponent ø will be denoted 
by ox when there is occasion to emphasize its dependence 
on k. By Corollary II of Bernstein's theorem, 


nn (£)| < hn hnk 


for ar41 < £L Bess, the factor h depending on ær, By, &r+1, and 
Brk+1, but not on anything else. Let æ, be a point in (@k+1, Besa) 
such that |7(2,)) = fnjeja, and let ð = Hn,k+/ (2 hM lnx). 
Because of the fact that pn, k+1 < fur, Ò <1/(2hn), and it 
is certain that at least one of the intervals (a, — ð, , 2), 
(æ, 2+ 4,) consists entirely of points belonging to (ær+1, B41), 
as soon as x is sufficiently large. This condition being satisfied, 


“+ 
‘ 
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1 1 
n(x) — Tn (a4) < ə Mn, k41 Tin (æ)| 2 p n, k+l, 


throughout the interval designated, and as it has been supposed 


that En A tin < I Btn, ki; 


v prt 


A , 1 ln, ka |” n, k+1 
r(x) — nn (£) 2 4 HMakt+is Jn 2 vd; (e) = 4a”. 2hnpnn’ 


Mn, k41 < Brn (Ngn feng FD, 


where Bx; is independent of n. Taken with the relations 
Qn E ATEM, wy, SA, N E, this means that 


< Bai (A A, Tn® H Tie) mbar 


Ma k4 = 
or, if Beir (A AyD)" is denoted by Axis, 


fn, kti < Appr NOTVIMTD f,,, 


Because of a qualification introduced at one stage in the 
work, the result has been obtained in the first instance only 
for values of n from a certain point on; but it can be extended 
to the finite number of values of n previously neglected, from 
n = 1 on, by a suitable adjustment of the value of Axi. 
It is recognized therefore that the successive numbers n have 
upper bounds involving a sequence of exponents ox, beginning 
with œ = 2/m, and so related that 0.41 = (6%-+1)/(m-+-1), 
It may readily be verified by induction that 


1 1 
io T m(m+1¥ * 


If is any positive number, it will be possible to choose 
a value of k for which ox<(1/m)+ 7. With such a value 
of k, 

Vn < hnk < Apn En» 


All this, it may now be recalled, is on the assumption that 
én < ra. But in any case 
Mn < 4 enp Ayn en, 


The last relation, by the definitions of 7, and en, implies 
that 
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SI (x) — Pa (x) | = |r (x) — n (x) | < 5 ent Ay n+ en 
for «<x <. More concisely, since 1 < n™* forn > 1, 


| f@) — Pa (a) | < Cn en, 


if C = Ak+ 5, the factor C, like those which have preceded 
it in similar situations, being independent of n. The previous 
results with regard to convergence may be supplemented by 
stating: 

THEOREM VII. Jf @(%) and P, (x) satisfy the hypotheses 
of Theorem VII, and if «o< @ < B< By, the polynomials Pa (x) 
will converge uniformly toward f (x) for a<r < B, ifm>|1, 
and if a positive number ņ exists such that 


lim w (6)/d¢™+7 — 0, 
Jd=0 


where œ (ò) is the modulus of continuity of f(a). 

COROLLARY. If @ (x) is continuous and nowhere negative for 
a< x< b, the hypotheses with regard to P,(x) and f(x) 
remaining unchanged, the polynomials Py (x) will converge 
toward f(x) at all points where oe (x) +0. 

For points in the interior of (a, b), this corollary super- 
sedes the second corollary of Theorem VII, being a direct 
generalization of it. 

There are analogous but less simple conclusions for 
0<m <1. 

Generalizations of Bernstein’s theorem (in appropriately 
modified form), leading to the extension of parts of the pre- 
ceding analysis to certain cases of development in terms of 
characteristic functions of differential systems, have been 
given by Miss Carlson (Transactions of the American Mathe- 
matical Society, vol. 26 (1924), pp. 230-240, and vol. 28 (1926). 
pp. 435-447). 

5. Polynomial approximation over an infinite 
interval 


Another extension of the scope of the method lies in its 
application to problems of polynomial approximation over an 
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infinite interval. To be specific, let f(x) be a function defined 
and continuous for all real values of x, and not greater in 
absolute value than a constant multiple of a power of x, as 
x becomes infinite in either direction. Among all polynomials 
of specified degree n, let Pa (x) be the one which minimizes 
the integral 


D &* (LF @)— Pa@F de. 


The proof of the existence of the minimum and the uniqueness 
of the corresponding polynomial offers no new difficulties in 
comparison with cases already treated, and will be omitted. (It 
can be based either on the construction of a sequence of orthog- 
onal polynomials over the infinite interval with the weight- 
functione 7", essentially the polynomials of Tchebychef-Hermite, 
or on the fundamental theorems of real analysis. From the 
latter point of view, the essential point in the passage to the 
infinite interval is merely that if any coefficient in P, (a) were 
large, the integral over a finite interval would be large, and the 
integral over the infinite interval would be larger still, so that 
the coefficients to be considered in the search for a minimum 
belong to a bounded domain.) It is to be shown that under 
suitable restrictions on f(z) the polynomial P,, (a) will converge 
everywhere to f(x) as n becomes infinite, and will converge 
uniformly over any interval of finite extent. The aim will 
be to arrive at a result of this character in as straight- 
forward a manner as possible, without regard for the greatest 
attainable generality. 

Let yn be the minimum value of the integral, and let « 
be an arbitrary preassigned positive number. Let p, (x) be 
an arbitrary polynomial of the nth degree, let 


Ta (ax) = Í (x) — Mlz), Mn (x) = Palæ) = Pn (x), 
so that 
JS (x) —Pn (x) = rn(x)— nn (x), 
and let ¢, be the maximum of | 7, (x)| for —(e+1)=x2<e+1. 
Let un be the maximum of |m,(z)| in the same interval, 
_ attained for + = x. 
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By the third corollary attached to Bernstein’s theorem on 
the derivative of a polynomial, there is a positive number H, 
independent of n, such that 


| n (x) — Tn (xo) < Hn tin |£ — to 2 


as long as æ is in that one of the intervals (—e —1, 0), 
(0, @+1). to which 2» belongs. This condition will be 
satisfied, on one side of z at least, for all values of n from 
a certain point on, if #—a <1/(4H?n?). So there will 
be an interval of length 1/(4H*n*) throughout which 

' 1 1 

| En (£) — rn (To) | < p Mm | æn (x) | 2 > tn: 
If it is supposed that n < }y,, then 
k al 
J (@)—Pn (æ) = |In (x) — Tn (x) ea 4 bn 
throughout the same interval, whence it follows that 


Yn > ee. Pn P ae = 
= 16 4H*n* 

where ¢& is independent of n. The alternative possibility 

that €n >t", may be left out of account until a later stage. 

Now let uh be the maximum of |, (x)| for —e<a2<e, 

It is certain that pn < un. Let zı be a point in (—e, «) 

such that |7,(a1)| = un. By the second corollary of Bern- 

stein’s theorem for polynomials, there is an % independent 
of n such that 


> 1/2 
Mn =i CNY ns 


Mnl) L Antin, |Ana) — anla) | < An un |£ — a l, 


for —« < x<a«. (It is clear that % must be positive.) The 
quantity n = uh / (2h nun) is less than æ for values of n from 
a certain point on, since un < un. One or the other of 
the intervals (xı — Ön, 21), (a1, 21+4,) is then contained in 
(—«, «), and there is an interval of length at least 0, through- 
out which 


1 re 
Tn (x) — Nn (x) | < > Ma T(x) | > 
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If én < den, 


rts, 
| F(a) — Pr (@)| = | rn(x)— 7n(x)| = 4 lin 
in this interval, and 


12 , 
Yn = € itr T . arcs Ith <A Ca (N fey yn)'®, 
with c independent of n. The assumption that £n < 4 uh carries 
with it the condition £, < }m,, under which u, < onyt, 
so that 


, » n? yd/2 ! < p yd yl 
Nat n Sen 7 2 By SOQ Yn 


If the hypothesis that en < łan is not fulfilled, uh <4¢6, 
immediately. In either case 


' ’ w 
Pa < Cy nas gas T 4e, ’ 


with the reservation merely that a finite number of values 
of n may have had to be ruled out in the course of the proof. 

As un and €» are upper bounds for | 7,(a)) and |r, (x) 
respectively, 


|S (@)— Pal)! = |rn(£)— an (x)| < cn? 7)? Ben 


for —«<r<«. For fixed «, the polynomials p,(2) can 
surely be determined so that liM- ér = 0, since f(x) is 
continuous. Jn order that Pa(x) shall converge toward f(x), 
uniformly throughout the interval (—e, œ), whatever value 
may be assigned to «, it is sufficient that limn—xn'*7n = 0. 
It remains to discuss the order of magnitude of yn, under 
appropriate hypotheses on f(a). 

There will be occasion to use a lemma regarding the 
behavior of a polynomial as its argument becomes infinite. 
Let pa(a) be an arbitrary polynomial of the nth degree, 
n = 1, and let M be an upper bound for its absolute value 
in the interval —1 <a#<1.- Let X(x), as in Chapter I, 
be the Legendre polynomial of degree k. Then p(x) may 
be written identically in the form 


Pn (x) = Ao Xo (7) + Ay Xa (x)+ «+» + An Xn(2), 
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where 


2k+1 (" $ 
Ak = at J me Xi (x) da. 


From the last equation, together with the definition of M and 
the fact that Xx(£)| <1 in (—1, 1), it follows that 


Ay | <(2k-+1) M. 


For « > 1, on the other hand, the identity 


Xi (2) = : f+ (2—1)? cos g]k¥ dg 


gives ' 
(X l| < |2alF, 


since (z?— 1)? cosy, < |æ|. Consequently, under the assump- 
tion still that 2) = 1, 
(pa (w)| < M[1+3|2a\+5/2a/?+ --- + (2n +1)|22l”] 

< M[1+3+5-+ .--+(2n+1)]|22) 

= (n +1)? M\2a" < (n+ n)?M|22\" = 4Mn?|22/". 


If pa(x) is a polynomial of the nth degree, as before, and 
if M is an upper bound for |p, (x)| in an interval — 8 < æ 
< B, where 8 is an arbitrary positive number, then pp (x) 
is at the same time a polynomial of the nth degree in 2/8, 
and for |x| > 2, |a/8| = 1, 


‘yn (x)| < 4Mn®\22/8\", 
This relation, under the hypotheses stated, constitutes the 
lemma in question. 


Let it be assumed now that f(a) has everywhere a first 
. derivative satisfying the condition 


ST" (ws) — f' (a)! < 4 | 2% — t|: 


Let A be an arbitrary positive number, and let Theorem VII 
of Chapter I be applied to the approximate representation 
of f(x) in the interval (— £, 8). The conclusion is that there 
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exists for each n> 1 a polynomial pn (x), of the nth degree, 
such that s 

; 2 K* p*2 

\f (x) — Pn (x) < : =. 


for — <x < 8, where K is the absolute constant that 
figures in Theorem I of the first chapter. 
As an immediate consequence, 


B 2e B 
W- f {7 UG) — rn (fda s ERE fi 
= 4K*pet Í 


=> nt 


ea aay 
—eo 


or, as the value of the last integral is m'*< 2, 


Furthermore, |p, («)| < M,+2K*A°4/n® for —B<2r< B, 
if M, is the maximum of |f(z)| in the interval. Let |,(0)| 
=m, |f'O)| = a; then |f’@)| < atile, fæl 
< aota,\r|+ 4427, and 


M, < m+m8+ y 1B. 


(Incidentally it is seen that the present hypothesis carries 
with it the fulfillment of the earlier requirement that f(x) 
shall not become infinite faster than a power of z.) If 8 > 1, 
as will be assumed henceforth, M, < (a)-+ a,+44)8%, and 


Pn (x)| < ey B® 


for — < x < ĝ, where c, is independent of 8 and of n, 
though it is different for different functions f(z). 
For « = &, by the reasoning of an earlier paragraph, 


AEE A- LAEN 
At the same time (that is, for || > 8 > 1) 


F@)| < wta r|++-Aa? z (++) x, 
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and by further enlargement of the right-hand member, with 
fy = A +a,+424 for abbreviation, 


f(a)| 2 ac? = 1 aa B? (2 L/B << — dy B? n? (2 v/ AF 


IA 
p: Bie 


aBn? | 2 2/8 |"; 


it has already been assumed that n = 
multiplier cz, independent of £ and of n, 


So, with a new 


| f(x) — pn (2) | < < (tat + a)" n®|22/8\", 


[ f(x) — pn (aw)? < cs BA n*| 20/8 |?" 
for |x| = B. 
This means that 


a90 Q\2n p” 
f, e” Lf (2) — pa (x)? dx < cs Btn" (3) J aae? da; 


the quantity on the right is an upper bound also for the 
integral from — æ to —ĝ. Since £21, 


an 1 cra 
[oom eo dz< f; ane tgd = — K y” e! dy 
B fis 
<t f peva =4 r(n+1) = zn! SA nine ait 
2 2 


Without reference to more elaborate approximations to the 
value of the factorial, it is seen at once that 


k+1 k41 
logk = f; log kat < f log tdt, k=1, 2; 
log [(n—1)!] = log1 +log 2 +----+log (n— 1) 
< fogta = n log n—n+ 1, 


(n—1)!<e(n/e)". 


Consequently 
Wig Ee” [J (x) — pn (° dx < = cs eB n? Hp S 
B a P B e 
wir 4 »( 4n\” 
= 3% eB n Be 
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and 
In = es e@ [fF (x) — pn (x)]* dx 


8KS 8*2 4n\" 
a taolin (Ze), 


The number £ has been arbitrary hitherto, except for the 
requirement that it be positive and not less than 1; the last 
relation is valid with the insertion of any such value of £. 
Now let 8 = 2n'?, This makes 
128 K* 2? 

n? 


IA 


Ins 


+ 16 cy en’ ee. 


As n? e” approaches zero when n becomes infinite, n e~" does 
not exceed a constant multiple of 1/n* for n => 2, while, on 
the other hand, yg, is not less than the minimum value yn. 
Therefore 
Yn Yn < Ad : 
= n 
with cs independent of n. 

The condition lim,—.« n*/* yn = O being satisfied, P, (x) con- 
verges uniformly toward f(x) throughout any finite interval, 
under the hypotheses stated. 

As in the case of the Fourier and Legendre series, the method 
under discussion, when applied merely to the classical problem, 
yields little if anything that is new, and misses much that is 
well known. Clearly, however, the treatment admits a variety 
of generalizations, which remain open for further investigation. 
To mention just one, which calls for no additional labor, the 
reasoning applies without material change if the weight func- 
tion e” is replaced by any positive continuous function which 
is never greater than a constant multiple of e~”. The theory 
thus suggested has been developed at some length in a thesis, 
as yet unpublished, by J. M. Earl. Theorems on degree of 
approximation over an infinite interval, without reference to 
the particular method of this chapter, have been published 
by W. E. Milne in vol. 31 of the Transactions of the American 
Mathematical Society. 
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CHAPTER IV 


TRIGONOMETRIC INTERPOLATION 


1. Fundamental formulas of trigonometric 
interpolation 

This chapter is concerned with certain striking analogies, 
both formal and analytical, between the theory of interpolation 
by means of trigonometric sums and that of Fourier series. 
The case of polynomial interpolation will be left out of con- 
sideration for the most part, since the analogies there, when 
the points used for interpolating are equally spaced, are 
rather with Taylor’s series than with those of Fourier and 
Legendre. Certain extensions to the case of interpolation 
by means of Sturm-Liouville sums have been carried through 
by C. M. Jensen (Transactions of the American Mathematical 
Society, vol. 29 (1927), pp. 54-79). 

Let to, ti, -++, fon be 2n +1 distinct numbers contained in 
an interval of length 27, for definiteness that from 0 to 27, 
inclusive of the left-hand and exclusive of the right-hand end 
point. Let Yo, i++; yon be any 2n-+1 real numbers, 
distinct or not. Let the problem be proposed of determining 
a trigonometric sum Th (x), of the nth order, to satisfy the 
conditions 

Tn (tr) = 'UYr5 r = Or 1,<-, Zn: 


The sum Th (x) has 2n-+1 coefficients, on which 2n -+ 1 
conditions are imposed. With the notation 


Ta (£) = do +a,cosa+--- +a, cosne 
+h, sin g+ .-- +h, sin nx, 
' 


the conditions to be satisfied are given explicitly by the 


2n +1 equations 
109 
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ao + COS ty + ly sin fy +..-ta,cosnty -+ basin n to = ‘Yo + 
ilo +a, cost, +h, sind, Shaka h: BACAR -H ened = nw , 


ito + dy COS boy, + bi Sin ton + +» belk An COS N ton 4 S sin” i = Va’ 


which are linear in the a’s and b’s. One is confronted by 
the problem of showing, directly or indirectly, that the deter- 
minant of these equations, the determinant 


1 cost sihi +: cosnty sinnty 
1 cost; sing --- cosné sinnt 
1 COSfon Sinton +++ COS Rte, Sin rho, 
is different from zero. 
If « and 8 are any two numbers, the product 


sin > (a—«) sin ; (a — 8) 


is a trigonometric sum of the first order in x, being identically 
equal to 


= A+ A, COS x of. B, sin T, 


with Ap = 4cos(t4}a—}f), A, = — }ł cos ($a +} A), 
B, = —4sin(4a@+ 3). In the expression 


pr (x) = 
sin 4 (x — to)++-sin $ (x — tk—1) sind (£ — tr) sin $ (x — ton) 
sin 4 (tr— to)-++ sin 4 (tr— tr—1) sin 4 (.— tr+1)--- sin 4 (te— toy) ? 


the 2n factors of the numerator can be combined in pairs 
to give n expressions, each of which is a trigonometric sum 
of the first order, and the product of these is a trigonometric 
sum of the nth order; the denominator is a constant which 
is not zero, since the difference of any two of the numbers z, 
is by hypothesis different from zero and numerically less than 
2a. It is apparent on inspection that gx (¢,) = 0 for r + k, 
while gx (tk) = 1. Consequently the function Tn (x) defined by 
the formula 
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T, @) = Šv Py (£) 
k=0 

is a trigonometric sum of the nth order such that Tn (t) = yr. 
r= Q0, 1,..., 2n. The linear equations above accordingly 
have at least one solution, for any assigned values of the y’s. 
But if the determinant of the equations were zero, there would 
be values of the right-hand members for which there would 
be no solution. Jt is certain therefore that the determinant 
is different from zero. Whatever values are given to the y’s, 
the proposed problem has one and just one solution. In par- 
ticular, if all the y’s are zero, the obvious solution in which 
all the a’s and b’s are zero is the only one. A trigonometric 
sum of the nth order which vanishes at 2 n+ 1 distinct points 
in a period is identically zero; two trigonometric sums of the 
nth order which coincide in value at 2n-+-1 distinct points of 
a period are identically equal. 

It is easy now to supply a proof of a fact which was pre- 
viously assumed as known, in connection with the proof of 
Bernstein’s theorem, namely that a trigonometric sum of the 
nth order vanishes identically if it has 2 n distinct roots in 
a period, one of which is a double root. Let Tn (x) be a sum 
of the nth order which takes on the value O for 2? n distinct 
points of a period, say for s = 4, fe, ---, fn, but which does 
not vanish identically. Let f be a point of the same period 
interval, distinct from 4, ---, tn, and let y = Tn (to); it is 
certain that yo +0, since 7), (x) can not have 2n +1 distinct 
roots in the interval. The expression 


: sin $ (x — 4) sin  ( — t) - - - sin  (@ — tn) 
Yo “Sin 4 (fo — h) sin 4 (to— t) -- - sin J (fo — tan) 


s a trigonometric sum of the nth order which takes on the 
value y for x = tọ, and vanishes for 7 = #,,---, fon, and so 
must be identical with T(x). The derivative of this expression 
is easily calculated explicitly; it is 


n> sin 4(a—¢,)---cos 4 (a — t;,)--- sin 4 (a — ton) 


T! fn) = i 
H Q= næ sin 4(H— ti) -sin $$ (fo— ti) +++ sin $ (f4y— tan) 5 
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For « = ?¢,, if r has any of the values 1,---, 2m, all the 
terms of the summation vanish, except the one for which 
k = r, and that becomes 


p, — SmdG—h)--- +++ Sin $ (tr— tan) 
j sin $ (fo— tı) --- sin $ (to— tr) --- sin }(to— tzn) ` 


which is certainly different from zero, so that Ty (t) =$ yo D, + 0; 
the sum 7),(~), assumed not to vanish identically, can not 
have a double root at any of the points t, ---, fen. 

To return to the problem of interpolation, /et the points ty 
from now on be supposed equally spaced over a period, As 
a matter of general notation, if m is any positive integer 
and r any real integer, positive, negative, or zero, let 


= 2ra/m. 


The discussion hitherto (apart from the digression of the last 
paragraph) is applicable on the assumption that m is odd, 
m = 2n-+1; there will be occasion subsequently to consider 
even values of m as well. Throughout the remainder of this 
chapter, in the absence of express indication to the contrary, 
the sign > will be understood to refer to summation with 
respect to the index r, from r = O to r = m—1, or what 
comes to the same thing, as all the functions considered will 
be of period 27, over any m successive values of r. 

It is a fundamental fact, whether m is odd or even, that 


> sin kt, = 0 
if k is any integer, and that 
> cos kt, = 0 


if k is an integer not divisible by m, while if k is a multiple 
of m (and in particular if k = 0) it is evident that 


a cos kt, = m, 


since in this case each term of the summation is equal to 1. 
The truth of the statement with regard to X sin kt,» is 
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apparent from considerations of symmetry. The term sin k fo is 
equal to 0. Since 
ktm = 2k(m—r)a/m = 2kr—ktr, 


each term corresponding to an index r between O and $m 
is paired with a numerically equal term of opposite sign. 
If m is odd, all the terms are thus accounted for; if m is even, 
the term which remains by itself is sin ~tm; = sin kæ = 0. 

In connection with the cosine sum it is to be recalled that 
sin (n+ $)u 

2singu ` 
(The right-hand member is understood to be defined by con- 
tinuity at points where the denominator vanishes.) If u = kt, 
then ru = kt,. and 

5 +eoskti + coskta t --- +coskt, = aibe, 
But by the symmetry pointed out in the preceding paragraph, 
the terms on the left are respectively the same as those of 
the sum 


L 
S + cosu cos2u+ --- +eosnu = 


1 
5 F cosktma+ cosh lm- ++ +++ cosktn—n- 


If m = 2n-+1, and if k is not divisible by m, addition of 
the two sums gives 
> cos ki Bin(n EPIR 
sin $ kt, 

and in the last expression the numerator is zero, since 
ti = 2a/m and n+} = 4m, so that (n+ })kti, = ka, while 
the denominator is different from zero. If m =— 2n, the term 
CoS kta = cos nkt, occurs in each sum; combination of the 
two, with subtraction of the redundant term, gives 


` sin (n + 4) kt 
D> coskt, = nee “— cos nkt; 
sinnkt, costkt,+cosnktsingkt, een 
= sin kt, Se late 


= sin nkt, cot Het = sin kn cot(ka/m) = 0, 


the assumption being still that X% is not divisible by m. 


8 
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The proposition of which the preceding lines give an 
analytical proof is almost evident geometrically, though 
a precise formulation of the geometrical argument requires 
a little attention to detail. The pairs of numbers (cost, sin £), 
y =0,1,---,m—1, are the coérdinates of the vertices of 
a regular polygon of m sides having its center at the origin. 
and the quantities (1/m) X cost,, (1/m) X sint#,, being the 
coérdinates of the center of gravity of these points. must 
evidently be zero. The more general fact of the vanishing 
of > coskt, and > sinkt, is then obtained by consideration 
of the various possibilities as to the existence of common 
factors of k and m. 

In the following statements, let p and q be integers subject 
to the restrictions 0 < p < 4m, 0<q< 4m. Then, in the 
identities 


cos px cosqxr = — [cos(p—q)x+cos(p+ qa], 

sinpx singa = = [cos (p — q)z — cos (p+ q)a]. 
3 Dp piy 

cospe singz = 5 [sin (p +) — sin (p — q) z], 


p—q and p+q are integers numerically less than m, except 
that p+q = m if m is even and p = q = 4m, so that 


> cos pty cos qty = 0 it p+q, 
> sin pt, singt, = 0 if p+q. 
> cos pty sin gt, = 0 for all p and q, 


> cos? Jir —— sin? pt, = si m if O< p< E m, 
: ; 2 2 


D cos? pt, = M; > sin? pt, = if p= 0 or : m. 


Until the contrary is stated, let it be supposed now that 
m is odd, m = 2n+1. The equations for the interpolating 
coefficients given in the second paragraph of the chapter 
can be solved explicitly, under the present hypothesis of 
equal spacing of the points 4. It is known in advance that 
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they have one and just one solution. If the 2n-+-1 equations 
are added as they stand, the left-hand member in the result 
reduces to (2n-+1)ao, so that 


iP ee 
= ont a 


(As to the convention with regard to the meaning of the 
sign X, it is to be understood here that the summation is 
performed specifically over the range r —0,1,---,2n, or 
else that y» is defined outside this range as a periodic 
function of the index by the prescription that y-+2.+1 = yr.) 
To determine ax, k>0, let the (+ 1)th equation be mul- 
tiplied by cos ktr, r = 0,1, ---, 2m. On addition of the 
equations thus obtained, the left-hand member in the sum 
becomes ar > cos*kt, = mar = 4(2n+1)ax, whence 


9 
ük = antl > Yr COS ktr. 
Similarly, 


2 
be = =——— F ysin kt. 
= ond D> yr sin ktr 


If the notation is modified to the extent of representing the 
constant term in the interpolating sum by ao/2, instead of ao, 
the general formula for a, gives the correct value of dp also. 
It will be understood henceforth that the notation is adjusted 
in this way, and the interpolating sum for equally spaced 
points ż. will be denoted by Sn (x), so that 


i 1 
Sa (x) = 3 dota cosr+ +++ + an cos nx 
+b sin z+ --. +), sin nz, 


the a’s and b’s being given by the two preceding equations 
for k = 0, 1,---, n. The resemblance to the Fourier coeffici- 
ents is apparent. 

Even if it had not been known a priori that the equations 
have a unique solution, that fact would be an immediate 
consequence of the work of the last paragraph. For the 
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work shows that if the equations have a solution, it must be 
the one indicated; that is, they can not have more than one 
solution for any given set of y's. But if the determinant 
of the equations were zero, there would be values of the y’s 
for which there would be infinitely many solutions. This new 
proof of the non-vanishing of the determinant, however, 
depends essentially on the assumption that the s are equi- 
distant, and so is less general than the one previously given. 

The formal resemblance to the case of Fourier series is 
further borne out by substitution of the values of the a’s 
and b’s in the expression defining Sn (x), and rearrangement 
of the result by means of the identity for a sum of cosines: 


ç n F 
Sr(a) = arr È” 5 + È (cosk t, coskx-+ sin kt sinke) 


2 if a 
Da] v| 9 + Seoskit,—m)| 


—_ 1 yg, sn+3)(6—2) 
5 TEEPA sin $ (t — <) ‘ 


Incidentally, the correctness of the last expression as a solution 
of the problem of interpolation can be verified directly, by 
substituting x = t, for any particular value of q. For 
by— tg = 2(r -g)a/(2n+1), sin (n +4) (tt) =sin(r—g)a=0, 
and all the terms of the summation vanish except the single 
one with a vanishing denominator, while the limiting value 
of the quotient of sines in that term is 2n+1, so that 
Sa (ta) = Yq: 

Now let the problem be changed by supposing that m is 
even, m = 2n. Corresponding to the 2” abscissas to, +++, fan—1, 
let 2n numbers yo, --+, Yen—1 be given, and let y, be defined 
for other values of r so that y+, = yr. Consider the 
question of the existence of a trigonometric sum 7',(x), of 
the nth order, represented for the moment by the notation 
of the second paragraph of the chapter, to satisfy the con- 
ditions 7',(t-) = yr, r= 0,1,---,2n—1. It is not to be 
expected that the problem will have a definite solution, since 
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only 2n conditions are imposed on the 2-+1 coefficients. 
The precise nature of the indeterminacy is brought out by 
going through the formal manipulation of the equations. 
Written out at length, these are as follows: 


aot ai costo +hsintfy +--+ arcosnto + basinnty 
— Yo: 
(y+ accost +h,sinf, +--+ a,cosnt, -+h,sinnt 


= ts 


ty + ay COS ton—1 + Dy Sin toni ++» + An COS 1 boy—1 + bn Sin N toy —1 
. . = yma: 
Direct addition of them gives 


2na = >, lb = (> y)/2n. 


Multiplied respectively by coskt, coskt,, -+ +, cosktoy,—1, for 
any value of k from 1 to n—1 inclusive, and then added, 
they yield 


(1/n) > yr cos ktr, 


k 
and similarly 
be = (1/n) > yr sinkt. 


So far there is no ambiguity, and no formal difference from 
the case previously treated, except the replacement of 2n-+1 
by 2n. When the (7+1)th equation is multiplied by cos nt, 
there is a difference because in the present case n = 4m, 
and > cos*nt, is equal to m = 2n, instead of 4m. The 
determination is still perfectly definite, however: 


ln = (Èy cosnt,)/(2n). 


But as 4.=- ra/n, the numbers sinz?, are all zero, and the 
use of these quantities as multipliers does not lead to any 
determination of ba at all. 

It is now apparent how the problem of interpolation with 
an even number of equidistant abscissas can be formulated 
so as to have a determinate solution. If S(x) denotes an 
expression of the form 
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=A a cosa +--+ an cos (n —1) a+ : än COSNT 


+b singz +.--+b,-1 sin (n—1)z, 
the conditions Sn (t) = yy are expressed by 2n linear equations 
for the 2n coefficients. The solution of these equations, if 
they have a solution, is given uniquely by the formulas 


ük = (1/n) > yr cos kt,, be = (1/ n) > Yr sinkt,, 
for k = 0,1,---,m. But if the determinant of the system 
were zero, there would be values of the y's for which more 
than one solution would exist. So the value of the determinant 
is certainly different from zero, and the problem has the 
unique solution indicated. As sinnz?/, — 0 for all values 
of r, the expression 

Sala) + by sinne, 
with any value of ba, likewise takes on the value yy when 
æ = tr; the indeterminacy of the problem as first proposed 
consists merely in the complete indeterminacy of the coef- 
ficient ba. The subsequent discussion will relate to the sum 
Sn(x) as defined at the beginning of this paragraph, with- 
out the term in sin na. 

For deriving a concise expression for S,(a), to be sure, 
it is convenient as a matter of form to include a term 
basin nz, the coefficient ba being defined by the general 
formula for bk, with =n; as each term of the summation 
vanishes, this gives b} = 0 automatically. Then it appears that 


is n—1 
Sn (x) = L Syl- + È cos k(t-—z) + 1 eos nea), 
n 2 k=1 2 
or, as 


; +eosu+ -~ + cos (n—1l)u+ 5 cosnu 


= sin nu cot : u 
Ere 25a 
by subtraction of $ cos nu from the identity previously used, 


Sa (x) = 5 Dyr sin n (t. — x) cot : (t-—2). 
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As in the case of an odd number of points, substitution of 
x = ty gives a direct verification of the validity of the formula 
for purposes of interpolation. The analogy with the partial 
sum of a Fourier series, however, is not superficially so much 
in evidence as before. 


2. Convergence and degree of convergence 
under hypotheses of continuity over entire period 


For the discussion of the analytical properties of the 
interpolating sums Sa (x), particularly questions of convergence 
as n becomes infinite, let the given numbers r be values 
of a function having specified properties: let f(x) be a given 
function of period 27, and let yr = f(t). Then Sp (x) 
coincides in value with f(x) at the points 4,, and the question 
is to what extent Sn (x) furnishes an approximation to f(x) 
at intermediate points. 

In the first place, let f(x) be a bounded function having M 
as an upper bound for its absolute value. Then a corre- 
sponding upper bound can be assigned for |S,(z). Let 
m= 2n+1. Ifa has one of the values tr, |S, (a) | = |S (t) 
< M. If x is not one of the numbers ¢,, let te be that one 
of these numbers which is nearest to z, or one of the two 
nearest, if two are equally near. Then 


7 : 7 
— —— <rctet+ ——. 
2A hk = eT 2n+1 


tr 
Because of the periodicity of the functions concerned, as has 
been noted already, the summation with regard to r in the 
formula representing Sn» (x) can be extended over any 2n +1 
successive values of r, and in particular from R— n to R+ n 
inclusive. Accordingly 


s NN RY. sin(n+ 4) (t-—2x) 
Wol = gn a7 |, x St) sin} (t,—2) 


a DE. Rin sin (n +4) (+-—z2) | 
= 2n+1 r-=R—an| sin $(t,-—2) 
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The absolute value of the last expression in bars never 
exceeds 2n -+ 1, as may be seen from its interpretation as a sum 
of cosines. This observation will be sufficient as far as the 
three terms corresponding to the indices r = R—1, R, R+1 
are concerned. The numbers tp:2—2, trs — T, +++, trin—@ 
are respectively greater than 27/(2n-+-1), 4a/(2n+1), --- 
-++, 2(n—1)a/(2n-+1); the same thing may be said of the 
sequence of numbers |/p-2—a , tr-s—x\, +++, | tr-n—@)3 
and for all the values of r in question, | 4,—a <a, so that 
sin 4(t,—a)| > (1/7) | t—x|. Hence 


"Rtn | sin (n+ 4) (— 2) 
rainl sm4G%—2) 


R—2 R+n . 
= B0n+1) sin (n + $) (t.— x) 
Z 82n+0+ ( Pes 2.) | BGT pea rece 

R—2 R+n 1 
= s s ý T 1) + ERS A r 

R-2 Rin 1 

<8@nti+n( > +S | 

r=R-n r=R+2 tp-— z] 

n—1 u—1 

< 


Sent D+ Ont St = sony tentns | 
j=1, He 


du 
u 


n—t1 
< 42n4+1)+2n+)f < 4(2n+1)+(2n-+1)log(n—l) 


< (2n+1)(4+ log n), 
and 
Sn (x)| < M(4-+-log n). 


The last parenthesis does not exceed a constant multiple of 
log n, for n 22, and |Sa(x)| does not exceed a constant 
multiple of M log n. 

Similar reasoning, with minor differences of detail, applies 
when m = 2n, the sum S(x) then being given by the formula 
of the third paragraph preceding. The conclusion may be 
stated comprehensively for both cases as 

Lemma I. Jf f(x) is a function of period 2 a satisfying the 
condition that 

\f@)| < M 
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for all values of x, and if Sy(x) is the interpolating sum of 
the nth order for f(a) corresponding to the subdivision of 
a period either into 2n-+-1 or into 2n equal parts, then 


Srv) < CM logn 


for n>1. where C is an absolute constant. 

The representation of the constant by the notation used 
in the lemma preceding Theorem IX of Chapter I does not 
imply that the constants are the same, though of course both 
lemmas could be stated together, with a single symbol to 
represent the larger of the two constants. 

For the application of the lemma, it is to be noted that 
St (a) — Sr(x), < MCM logn, which is likewise not greater 
than a constant multiple of M logn, say BM log n; that the 
interpolating expression corresponding to the sum of two given 
functions is the sum of the interpolating expressions con- 
structed for the two functions separately, and the error of 
the sum is the sum of the errors; and that if m = 2n + 1 
the interpolating sum Sn (x) formed for a function /(#) which 
is itself a trigonometric sum Ta (x) of the nth order is iden- 
tical with 7%, (x), since by reason of the interpolating property 
Sua (x) and Th (x) are trigonometric sums of the nth order 
coinciding in value at 2n-+-1 distinct points of a period. 
Taken together with the lemma, these observations yield 

THEOREM I. Jf f(a) is a function of period 27, if Sy x) 
is the interpolating sum of the nth order for f(x) corresponding 
to the subdivision of a period into 2n-+-1 equal parts, n>1, 
and if there exists a trigonometric sum Ty (x), of the nth order, 
such that 


JS (a) — Tn(x)| < En 


Jor all values of x, then, for all values of x, 


| f(a) — Sn (Œ) | < Beplog n, 
where B is an absolute constant. 


When m = 2n, the interpolating sum of the nth order, 
as defined above, for a trigonometric sum Th (x) of the nth order 
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is not in general the same as 7),(x), since the interpolating 
sum does not contain any term in sin nz. But the inter- 
polating sum Sysi(x) of order n-+1, obtained by taking 
m = 2n-+2, is identical with T(x). For the coefficient 
dn+1 in Sa (£), given by the expression 


[1/(n+1)] D Ty (t) cos (n +1) t, 


reduces to zero, since each of the sines and cosines in Ta (x), 
through sin ng and cos nx, is orthogonal to cos (n+ 1)a for 
summation over the finite range in question; and it follows 
that Spsi(z) and T,(x) are trigonometric sums of the nth order 
agreeing in value at 2n +2 distinct points of a period. The 
above statement may therefore be supplemented as follows: 

THEOREM I (continued). If Sn41(x) is the interpolating sum 
of order n+-1 corresponding to the subdivision of a period 
into 2n +2 equal parts, the hypotheses remaining otherwise 
unchanged, then 


S (2) — Sny (£) < Ben log (n+ 1), 


where B has the same value as before. 

As log(n+1)<2 logn for n>1, the right-hand member 
may be replaced by 2 Be, logn. As an alternative. n may 
be replaced by n —1, to give 


| f(a) — Sn) < Be, logn 
for m = 2n. 

Like Theorem IX of Chapter I, the present Theorem I can 
be combined with Theorems I-IV of Chapter I to give a suc- 
cession of more specific results. The cases of an odd number 
and of an eyen number of subdivisions can be covered by 
a single formulation each time, by virtue of the observation that 
1/(n—1) < 2/n and w[27/(n—1)] < o(42/n) < 2% (2 r/n), 
for n> 1. The symbol A is merely a notation for the largest 
of what would be obtained in the first instance as a finite 
number of different constants, and Ap similarly: 

CoroLuary I. Lf 


J (xs) ERENS |t — Tı 
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for all values of x, and xs, 4 being a constant, then 


: AA log: 
f)—&@| << r 
for m=2n+1 and for m= 2n. 
CoroLLARY M. Jf f(x) is continuous with modulus of con- 
tinuity (0), 


S (a) — 8, (z)| < Aw (=) log n 


Jo m=2n+1 and for m= 2n. 

CoroLLARY Ila. The interpolating sum Sp(x) converges 
uniformly to the value f(x), as m becomes infinite through 
odd or even values (or both), if f(a) has a modulus of con- 
tinuity (0) such that limg—, (9) log ò = 0. 

CoroLLARY IM. Jf f(x) has a pth derivative fP (x) such 
that 


fOe) — f(a) < Alas — ar 
for all values of xı and z, 4 being a constant, then 
3 > < Apt log n 
IA) — Sn(a)| < “28 


for m= 2n+1 and for m= 2n. 
CoroLLARY IV. If f(x) has everywhere a continuous 
pth derivative with modulus of continuity (0), 


SF (@) —Sn(z)| < e zo|” T) logn, 


for m = 2n +1 and for m = 2n. 

In each of these statements, the conclusion holds for all 
values of x, and for all values of n = 2; the coefficient A is 
an absolute constant, while Ap depends only on p. 


3. Convergence under hypothesis of continuity 
over part of a period 


The next proposition is analogous to a result obtained in 
the second and third paragraphs of Chapter II: 
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Lemma Il. If f(x) is a function of period 2a, bounded 
and integrable in the sense of Riemann, 


lim (1/m) > f (t,) cos nt, = lim (1/m) > S(t) sinnt, = 0, 


as m becomes infinite through odd values (m = 2n-+1) or 
even values (m = 2n) or both. 

There is a noteworthy difference in the hypothesis imposed 
on f(x); it is in the nature of the case no longer sufficient 
to assume merely that the function is summable, or summable 
with its square, since an enumerable set of very large values 
of f(x), while not affecting the definite integrals of the 
earlier chapter, might throw the present sums entirely out of 
proportion. 

Suppose first that m = 2n+1. Let the summation be 
extended for definiteness from r = 0 to r = 2n. Since 
ty = 2ra/(2n-+1). 


m arne EI S s ae ee a 
A E 2n +1 EA E 
T7 
cosn t, = (— 1)" cos mF’ 


2n 


iat. È f(t) cos nt, 
n “So è ? 
= L 27r Sf. 47% +) eos: 29" 
Qa f= 2n+1 2n +1 2n+1 


1 & 2x (PRI) cog OZ 
On f= 2n Fil 2n+ 1 E 2n+1 ° 


When 2 becomes infinite, the first sum in the last member 
approaches 


= , 
f f(u) cosu du, 


by the definition of integrability, and the second sum has 
the same limit, so that the whole expression approaches zero. 
(Neither sum as it stands is exactly of the form that would 
aa ordinarily be -written down in defining the definite integral, 


TEN 
8 
oe, 
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but the discrepancy in each case amounts to half of a single 
term, and approaches zero in the limit.) As 


7 AAE 

sinint,. = e a= 
r ( ) on +1 , 

similar reasoning applies to the expression 


[1/(Qn+1)] Y) sinnt. 


If m = 2n, the summation going from r = 0 tor = 2n—1, 


b = nN, cosnt, = cosra = (—1)’, 
1 
in DAS cos nt, 
z 1 n=1 TPF La 1 n n pns 
"> 12k Š n s| n Qa È aI n |: 


Each sum in the last member approaches 


fre u) du, 


and the whole expression approaches zero. As for the other 
expression in the statement of the lemma, sin n t, = 0 throughout. 

Suppose f(x) is a function of period 27, bounded and 
integrable in the sense of Riemann, which vanishes for 
so — qL r< roty. It will be seen that 8, (x) converges 
to the value 0, whether the values of m entering into the 
definition of the sums S, (2) are even or odd. In both cases, 
Sn(x) can be represented, artificially but with readily verifiable 
accuracy, by the single formula 


Sa(x) = t SREY sin n (t, — x) cot ; (t — 2) 


1 


Fg ae ati) > f(t) cos n(t-— x). 
As cos n(t;— ax) = cosnax cosnt,-+sinnax sinnt, the last 
sum, taken with the factor 1/(2m), approaches zero uniformly 


for all values of x. And as f(t) cot}(¢— 2) is bounded 
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and integrable in the sense of Riemann as a function of ż, 
the first factor vanishing where the second factor is large, 
application of Lemma II to this function shows that the first 
part of the expression for Sn(x) approaches zero for £ = 7o. 
If two functions, each bounded and integrable in the sense 
of Riemann, are identical from z— 7 to £o +7, the difference 
of their interpolating expressions converges toward zero at £o; 
if f(x) is any function of period 2a, bounded and integrable 
im the sense of Riemann, the convergence of the corresponding 
interpolating sums S,(a) at any specified point, as m becomes 
infinite through odd or even values, depends only on the 
behavior of f(a) in the neighborhood of the point in question. 

Now let f(a) be of period 27, bounded and integrable in the 
sense of Riemann, and identically zero for e—y < x < B+y7. 
and let attention be directed to the problem of showing that 
Sn(#) converges toward zero uniformly for « < x < £. It 
was noted in the last paragraph that S,(#) either is given 
identically by the expression 


L St) sin n (ly —a) cot 4 (e—a), 


or differs from it by an amount which approaches zero uniformly 
for all values of x as n becomes infinite. Furthermore, the 
terms resulting from the expansion of sin x(t,;-— x) may be 
considered separately, and as the factors sin næ and cosna 
are bounded, it is sufficient to demonstrate the uniform con- 
vergence of the expressions 

1 S(t) cos nt, cot 5 (tr — x), 


m 
1 > : 1 
f(t) sin nt, cot —(t,— x), 
m 2 
as x in the argument of the cotangent ranges over the inter- 
val («, 8). 

Suppose in the first place that m = 2n+1. Let x be 
restricted to the interval «<x <£. Let C(x, t) be a function 
which vanishes when |¿— g | differs from an integral multiple 
of 27 by less than 7, and is equal to cot }(t— x) elsewhere. 
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Then f(t) C(x, t) is identical with f(t) cot }(¢—), under the 
restriction imposed on s, while C(x, #4) is bounded for all 
values of x and ¢, never exceeding cot 47 in absolute value, 
and is /-integrable as a function of ¢. By adaptation of 
the formulas of an earlier paragraph, 


[27r/(2n+-1)] > t) cos nt C(x, ty) 
is equal to 


n 


Errar (52) cos 2j r c| djn ) 
S 2n+1° pras  2n+1 \?2n+1 


minus another expression ofsimilar form. Let uj 2ja/(2n-+-1), 
j =0,1,---, n, and let du; = 2a/(2n+1) forj = 0, 1,---, 
n—1, while du, = a/(2n+1). Then the sum just written 
down differs merely by the quantity 

7 4nm _ ana a| énn 
2n-+1 ee i] ES 2n+1 £ (x. Qn+ i) 


which approaches zero uniformly for all values of x, from 


È Su) cos uj C(x, 2u) Auj, 
J= 


a sum corresponding to a subdivision of the interval (0, 7) 
into n+ 1 parts, not all of equal length, and defining the 
definite integral 


ji Jf(2u)cosuC (z, 2u) du 
in the limit. 

The essential point for the present argument is that the 
sum differs from the integral by an amount which approaches 
zero uniformly with respect to 2. Let M; and mj be the 
least upper bound and the greatest lower bound of the 
integrand over the jth sub-interval, P; and pj the corresponding 
bounds for the factor f(2u)cosu, and Qj and gj those for 
the factor C(x, 2u). The difference between the sum and 
the integral does not exceed 


2 (Mj—myj) dw. 
J=0 
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Let M be the least upper bound of /(2u), as u varies 
withoutrestriction, anupper bound consequently for /(2u) cosw), 
and Q(= cot 4y) the least upper bound of C(x, 2u). Then 


Mj— mj < M(Q—G)+QPi—D), 
since for any u and w of the sub-interval in question 


f(2 u2) cos us C(x, 2us)— f (2u) cos u, C(x, 2u) 
= f(2uy) cos us [C (x, 2u)— C(e, 2%)] 
+ C(x, 2u) [£2 ug) cos us —f (2m) cos tu]. 


As f(u) is by hypothesis integrable in the sense of Riemann, 
f(2u) cosu is R-integrable also. Let € be an arbitrary 
positive quantity. By the property of integrability (more 
specifically, by Darboux’s theorem) it is possible to choose 
so large, and the intervals Awu; in consequence uniformly so 
small, that 


n n 
te E 
È (Pr) Au< 2Q° 2,0 —p) Ay < 3° 
On the other hand, 


p> (Qi — 4i) 


can not exceed the total variation of C(x, 2u) as u ranges 
over the interval (0, æ), a total variation which is finite and 
independent of x, being equal to 4Q. Hence if n is taken 
so large that 27/(2n-+1)<«/(8QM), it follows that 


é 


2 M(Q;— qj) 4uyj< oe 
j=0 4 


So there exists an NV, independent of x, such that for n 2 N 


n 
> (M—m)åu <e. 
j=o 
The other sum analogous to the one just treated is 
S). 22 aj Uz) (27—1)x 
a s 4 Cla, — ~ 
D nT" | Intl | Qnti N In+1 
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By addition of the term 


2n+1 


a ia Ate ,@n—a f: 2@n—1)x) 
2n+1° 2n+1 SRST ES ee 


which approaches zero uniformly, this becomes a sum ap- 
proximating the same definite integral as before, and corre- 
sponding to a subdivision of the interval of integration into 
n—1 parts each of length 2a/(2%-+1) and one part of 
length 32/(2n-+-1). The difference between the sum and 
the integral approaches zero uniformly, and the difference 
between the two sums does the same, which means that 


—— > f (tr) cos nt, cot $ (t, — x) 
an á 


converges uniformly toward zero. The corresponding ex- 
pression with sinn?, in place of cos nt, can be treated in 
the same way. This completes the proof of uniform con- 
vergence of S, (x) for odd values of n. 

For m = 2n, the discussion of the sum containing cos n ty 
is slightly simplified in form by the fact that cos nt, reduces 
to (—1)", and by the fact that 2a/m is an exact submultiple 
of a, but otherwise follows essentially the same lines as 
before. The sum with sin x?¢, vanishes identically for all 
values of n, since sin n tr 0. 

The conelusion is embodied in 

Lenna MI. Jf f(x) isa function of period 2a which is 
bounded and integrable in the sense of Riemann, and which 
vanishes identically for @— < x < B+-9, the corresponding 
interpolating sums Sa (x) converge uniformly toward zero for 
«a< x< l, as m becomes infinite through odd or even values, 
or both. 

On the formal side it may be pointed out in passing, and 
might have been noted before, that for m even S), (7) has 
the alternative representation 


1 ee 1 
Se sin n LÒ flt) cos nt, cot = (t-— x), 
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though the appearance of sinng as a factor of the whole 
expression is illusory to the extent that corresponding to 
each value of x for which sinna = 0 there is a factor 
cot $(4.— x) which becomes infinite. 

Lemma III may be combined at once with Corollary Ila 
of Theorem I to give 

Tutoren M. Jf f(x) is a function of period 2a, bounded 
and integrable in the sense of Riemann over a period, and 
continuous for @—y < xr < B+y with a modulus of con- 
tinuity (6) such that limy—, ol) log Ò = 0, the corresponding 
interpolating sums S,(ax) converge uniformly toward f(x) for 
a< x<, asm becomes infinite through odd or even values, 
or both, 


4. Convergence under hypothesis of limited variation 


There are theorems of convergence for functions of limited 
variation, analogous to those obtained in the case of Fourier 
series. Let f(x) be a function of period 2a, with limited 
variation over a period, and let x have a value, to be regarded 
for the time being as fixed, such that /(f) is continuous for 
t=. It will be shown that S,(z) converges toward f(s) 
for the value of r specified. 

Let p, (t) and pə(t) be the positive and negative variations 
of f(t), measured from the point t = — 27, say, so that 


S(t) = TA es 2a)+% (t) — pe (t); 


while g, and gg are monotone increasing, and continuous for 
t= xz. (The point s may of course be thought of without 
loss of generality as belonging to the interval (0,27), and 
then there will be no occasion to take account of values of 
t less than —27.) Consider the case m=2n-+1. If 
is one of the numbers ¢,, for a specified value of n, Sn (£) = f) 
exactly. Otherwise, let tr be the number ¢, which is nearest 
to x (or one of the two such numbers, if two are equally 


near), so that 
T 


T à 7 
tp — ——__ << 2 < trt = - 
č 2n +1 = = ont 
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Let the summation over r in the formula representing Sn (x) 
be extended from R —n to R-+-n. The corresponding formula 
with f(t) replaced by 1 represents 1 identically, since 1 is 
a trigonometric sum of order zero. If this identity is multi- 
plied by the quantity f(x), independent of r, and the result sub- 
tracted from the formula for Sn (a), the difference Sn (a) — f (£) 
is obtained in the form 


Sale) — fa) = 


R+n ° 
F 5 TTE sin(n-+-$)(t,— x) 
Q2n+1 r Rn LE) — ro) sin $ (t — x) 


The right-hand member is equal to the difference between 


i R4n l „a Siin H 4) (4 — x) 
2 A [v1 (t) — p1 (a)] = sin $(t-— 2) 


2n-+-1 +R 


and the similar expression with @s(¢-)— s(x) in place of 
tı (tr) — fı (x). 

The quotient of sines never exceeds 2n-+1 in absolute 
value. In the term corresponding to the index r = R, tr 
approaches s as n becomes infinite, and 9, (tr) approaches 
yı (x), by the continuity of y,, and the whole term, multi- 
plied by 1/(2n + 1), approaches zero and need not be further 
taken into account; it is sufficient to consider the sum from 
R-+1 to R-+n and the corresponding sum from R— n 
to #—1. 

Since trij — x = tr—a#+[2ja/(2n+1)]. 


sin (n+ z) (irj = 2) = sin| (n+ >) (tr— x) +ia| 


= (—1) sin (n+ >| (tr— 3), 


and is independent of j, except as to algebraic sign. If 
4(trij—x) is denoted by wj, and if ¢ stands for 0 or 
according as sin (n + })(tr— x) is positive or negative, 


— 


1 R+n adi (n $ y Ke 2) 
Fatt Sn Vo) E 


= IF È (—1) A; B;, 
£ 
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where 
sin (n + 3)(tr— x) 
(2n-+ 1) sin uj 
The numbers Aj, B; are all positive or zero. As 7 increases, 
A; increases or remains unchanged; wuj is always between 0 
and 7/2, sin uj increases, and Bj decreases. Let V be the 
total variation of f(a) over an interval of length 2a. Then 
the A’s have the upper bound 3 V, since the positive and 
negative variations of a periodic function over a period are 
each equal to half the total variation; while B;, which is 
equal to the absolute value of 


sin (n +4) (tri j— x) 
(2n + 1) sind (trij— £) i 
can not exceed unity. 

After the analogy of the proof in the case of Fourier series. 
where the second law of the mean was employed, it would 
be natural here to use the method of summation by parts. 
The formulation can be slightly simplified, however, by reason 
of the fact that the sequences Aj, B; are both monotone, 
though varying in opposite senses. In general, let a,,---, ap. 
Di, +++, bp be two sets of p numbers each, satisfying the 
conditions 

4 = tgZ++-=> t= 0, by 2 be 2 +++ > In = O. 


Aj = pm (2 +2 w) — gp (2), By = 


SALUS loesch. 


Then 
— > (—1¥ aj Dp—j+4 = & by == is bp + ad 


j=1 
= u (bs — cp) — a2 (bi — pa) + --- 


= b (ay — dg + ---) — (ay Cp — a: Cp + +++). 


Since both the a's and the c’s are non-negative and decrease 
monotonically, the value of each parenthesis in the last 
expression is positive or zero, and not greater than its leading 
term, in one case a, and in the other case a, ¢p, which in turn 
is not greater than a, b. So the whole expression is the 
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difference of two non-negative quantities, neither of which 
can exceed a, b, and is itself not greater than a, >, in ab- 
solute value. To emphasize by words rather than by subscripts 
what is essential in the conclusion, the absolute value of 


$ (— 1 y aj Dp—j4 1 
J=1 


does not exceed the product of the largest of the ws by the 
largest of the b's. 

Let e be an arbitrary positive quantity. Corresponding to 
the continuity of y,, let Ò > 0 be chosen so that | p, (t)— pı (a)! 
< fe for |\t—a <ô. Let the sum X (—1) A; B; of the 
second paragraph preceding be broken up into two, >” and 
>”, in the first of which ïj ranges over the values for which 
2uj< ð, while the remaining terms make up the second. 
In ©’, the largest value of A; is not greater than 4€, while 
Bj never exceeds 1, so that by the conclusion of the last para- 
graph, with suitable adaptation of the subscripts,’ < }«. 
In $”, Aj < $V throughout, while B; < 1/[(2n +1) sin 4 ð]; 


consequently 
>") < v|[2en+ nsn} o], 


which is less than }¢ as soon as » is sufficiently large. 
Hence X'+”, or, in the earlier notation, the sum from 
R-+1 to R+n, multiplied by 1/2n-+1). approaches zero 
as n becomes infinite. The sum from R— n to R—1 and 
the corresponding sums with gs in place of p, can be treated 
in the same way, to show that S,(#)—/(x) converges 
toward zero. 

If f(x) is continuous everywhere, the uniform continuity 
of p, and ps, bearing on the choice of ð in the preceding 
paragraph, and, as a detail, on the convergence toward zero 
of the single term in each sum for which r = R, yields at 
once uniform convergence of Sy (x) toward f(s). 

Finally, a precisely similar argument can be carried through 
for the case of an even number of interpolating points, 
m = 2n. 
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The conclusion is 

THEOREM II. Jf f(x) is a function of period 2m having 
limited variation over a period, Sn (a) converges toward f(x) 
at every point where f(x) is continuous, when m becomes in- 
finite through odd or even values, or both; if f(x), still sup- 
posed to be of limited variation, is continuous everywhere, the 
convergence is uniform for all values of x. 

This result can be generalized immediately by reference 
to Lemma II, and the more elementary fact that convergence 
at any point depends only on the behavior of the function in the 
neighborhood of the point: 

COROLLARY. If f(x) is a function of period 2a, bounded 
and integrable in the sense of Riemann over a period, of 
limited variation for %—y < x < a+, and continuous 
for x = x», Sn (Xo) converges toward f(a): if f(x) is con- 
tinuous and of limited variation for œ — ny S< x < +y, 
the hypothesis remaining otherwise unchanged, Sn (a) converges 
toward f(x) uniformly for @ < xr < B. 


5. Degree of convergence under hypotheses 
involving limited variation 
Let f(x) once more be of limited variation over a period, 
with total variation V. Let ọ,(x) be the positive variation 
of f(x) from 0 to s when x>0, and minus the positive 
variation from « to O when s<0, and let pə(x) be the 
correspondingly defined negative variation function, so that 


F(x) = FO) + 91 (x) — p: (x), pı (0) = p: (0) = 0. 


Let m = 2n +1, n>0. Consider the sum > /(t,) cos ntr, 
extended specifically from r = —n to r = n. The sum 
X /(0) cos nt, vanishes. In X y(t) cos nt,, the term corre- 
sponding to r = 0 vanishes with p, (0). It has already been 
observed that cos nt, = (—1)’cos[ra/(2n+1)]. In the 
sum 

D> 9: (tr) Cos ty, 


r=1 
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the factor p(t) increases monotonically, having the upper 
bound y, (æ), while, as 77/(2m +1) remains within the limits 
(0, 7/2), \cosnt,| decreases monotonically, having the upper 
bound 1. Consequently, by the reasoning of an earlier 
paragraph, 


I 
| È pi (b) cos ate) < p (7). 
|r=1 
Similarly, the absolute value of the sum from — n to — 1 


does not exceed |pı(— 7). So the whole sum from — n 
to +n does not exceed 


i i 
p (70) + gı (— 7) = pı (m) — pı (— 7) = 5 y. 


After analogous reasoning with pẹ in place of pı, it is con- 
cluded that 


Ds lteosnt,| < V. 


There is a corresponding inequality 


| > S(t) sin nt, 


A 


V, 


the proof being simplified in this case by the fact that the 
monotone sequences Pilt). (pelt), (sinnt. vary in the 
same sense. 

For m = 2n a still simpler calculation, based on the fact 
that cos nt, = (— 1)", shows that | Xf (t,) cos nt| < V, while 
D flte sinnt, = 0. 

Disregarding the special simplicity of the last observation, 
it is possible to state comprehensively 

Lemma IV. Jf f(x) is a function of period 27, with limited 
variation over a period, its total variation being V, then 


> f(t) cos nty, < F, > F (tr) sinnt, < F, 
whether m is odd (m = 2n +1) or even (m = 2n). 


Let f(x) be of limited variation over a period, with total 
variation V, and identically zero for «œ —nņ < x < +y. 
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It has already been pointed out in substance that if em is 
a symbol denoting 1 or O according as m is odd or even, 
Sn (x) satisfies the identity 


: 1 3 
mS, (a) = cos nu > Sf (tr) cot 5 (t; — x) sin nt, 
. ~ neg 1 
—sin nr, S(t) cot = (tr— x) cos nt, 


t 


+ em COS naz > f (tr) cosnt, + em sin nx > f(t) sin 7 tr. 


For any value of x in (@, 8), the product f(t) cot $(¢— <x) 
is of limited variation, regarded as a function of ¢, and its 
total variation does not exceed the product of V by a quantity 
depending only on y. So the application of Lemma IV to 
each of the four sums in turn leads to the 

COROLLARY. If f(x) is a function of period 2a with limited 
variation, the total variation over a period being V, and if 
J (x) vanishes identically for a — ny < x < B+y, then 


Sa (x)| < C,V/n 


Jor a< x < B, where ( ’, depends only on 4. 

This Corollary in turn may be associated with Corollary H 
of Theorem I, in the light of the discussion leading up to 
Theorem Va in Chapter II, to yield 

THEOREM IV. Jf the function f(x), of period 2a, is con- 
tinuous with modulus of continuity œ (0) for @—y <£ < B+, 
where w(d)>0 for d>0, and of limited variation over the 
rest of a period, then 


J (2) — Sn (v)) < ce (2a/n) log n 


for @ <r = RB, ifn is large enough so that w(2a/n) has 
a meaning, c being a constant which depends neither on x 
nor M n. 

Lemma IV is analogous to Theorem IH of Chapter H, the 
sums in the present lemma, multiplied by a quantity of the 
order of 1/n to represent the length of the interval between 
successive points ¢,, corresponding to the integrals in the 
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earlier formulation, though as the length of the interval is 
either exactly or approximately a/n, according as m is even 
or odd, the agreement does not extend to the numerical values 
obtained for the constants in the right-hand members of the 
inequalities. It is sufficiently noteworthy, however, that the 
agreement should be as close at it is, since for any particular 
value of n the number of points used is not large enough 
to make it all evident a priori that the sums give close 
approximations to the values of the integrals, in view of the 
presence of the factors cos na, sin naz under the sign of 
integration. 

The analogy can be extended to the conclusions of Cor- 
ollary I of the theorem referred to. The proof is perhaps 
most readily given by means of the Corollary itself. Suppose 
f(x) has a pth derivative with limited variation. Then f(x) 
is represented by a convergent Fourier series; if the Fourier 
coefficients are denoted by €k, Bx, 


fo=< + È (ax cos kæ + Be sin ka), 
æ =1 


The series may be used to represent /(/,) in evaluating the 
sums X f(t) cosnt,, > f(t) sin nt,. Let m = 2n+1. The 
expression X cos kt, cos nt, vanishes unless k— n or k+ n 
is an integral multiple of 2n +1, that is, unless & has one 
of the values n, n+1, 3n+1, 3n+2, 5n+2, 5n-+3,---. 
The value of the sum in each of these cases is (2 n+ 1)/2. 
The sum Y sinkt, cos nt, is zero in all cases. Consequently 


D> ft) cos ntr = [2 n4 1)/2] (antri Hasni tH asn 
Similarly, 

D F(t) sin ntp= [(2n+1)/2] Bn—Bnsr+ anpi onat): 
But by the Corollary cited, 


|ær| < Viak? t), Be | < V/(ak?+), 


if V is the total variation of f” (s). So 
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| n+ @ntat asni + Czn42 +. 


J 1 1 1 1 
< = - agers —— +... 
E E tapi ' Bnp t Bnp ¢ | 


WIA 1 1 1 | 
A + - -- — a +... 
= Fez net © (Bn)P (3 nj? | 


2V 1 1 
= [1+ 3Pt1 af 5P 


ezi N: 
the series in the last pair of brackets being convergent for 
p = 1, and having incidentally, for p > 1, the upper bound 
1 + 1/3*+-1/5*+ -.., independent of p. A similar inequality 
holds for the series of 8's. If m = 2n. 


D> f(t) cos nt, = 2n (ant Gan Asn + see), 


and X f(t,) sinnt, = 0. Whether m is odd or even, there- 
fore, 


| D> S (ty) cos nt, | < CV/n?, > f(t) sin ni| < CV/n?, 


where C is an absolute constant. 

This result, supplemented by a type of argument which 
has been used a number of times already, gives at once 
certain information with regard to the degree of convergence 
of interpolating sums S, (x). For example, if f(x) has a first 
derivative which is of limited variation over a period, and 
if f(x) vanishes identically for æ — q < r < +y, then 
Sa (x)| can not exceed a constant multiple of 1/n* for «e <s < 2; 
if f(x) has a first derivative which is of limited variation 
over a period, and if f(x) is continuous with modulus of 
continuity o(ð) for «—y < x< 8+4, but not identically 
constant over the interval, then | /(#)—S),,(@) can not exceed 
a constant multiple of (1/n) o(2 z/n) logn for æ < x < 2. 
But such observations are of secondary interest; and it is not 
possible to pass over immediately to a proposition analogous 
to Theorem IV of Chapter II, for the transition from a specified 
interpolating sum Sy (x) to one of higher order does not con- 
sist merely in the inclusion of additional terms directly subject 
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to the inequalities that have been under discussion. For the 
problem thus suggested a different procedure is required. The 
detailed treatment will be limited to the next case in order, 
that of a function having a first derivative of limited variation 
over a period (or expressible as the integral of a function 
of limited variation), and not otherwise restricted. 

The key to the discussion of this case is the fact that if 
f(x) is the integral of a function of limited variation over 
an interval a < x < b, the quotient [/(~) —f(a@]/(a — a) 
(defined in any way, by the value f'(a +) or otherwise, for 
a = a) is likewise of limited variation over the interval. Let 


fe) = fa +f seda, 


where g(x) = 9, (x) — p(x), and p, and pẹ are bounded 
non-decreasing functions from a to b. Let 


s 1 x 
Oiz) = EA f gy (x) dæ 


for a<a<b, while ®, (a) = ¢, (a), and let a function ®, (7) 
be similarly defined in terms of pə(x). Then, for «> «a, 
LOI) _ 0. (z)— 0, (2). 
a a 

The assertion with regard to the quotient on the left will 
be proved if it is shown that @,(x) and @,(z) are non- 
decreasing; the fact is rather obvious, and in formulating 
a proof it is clearly sufficient to consider one of the two 
functions. For any value of «>a, 


(x — a) pı (a) < f yi (x) dx < (x — a) pı (2), 


by reason of the monotone character of ,, so that 
D, (x) = pı (a) = O, (a). Let a, and a, be any two values 
such that a< xı <x <b. Then 


J. gi(x) dx = (a— a) X,, 
“Wy Ñ Ti Ly 
f Pı (a) da =| Nik = (ay —— a) Ni l- (Zi — Tı) Xs . 
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where 
g(a) < X < yy (ry) <M”, (a). 


No 
A pı (x) da > (a,— a) X, + (az — 2) Xı = (a2 — a) X, 


whence ®, (x) > X,; as X, = @, (a), the desired conclusion 
is established. Similarly, [f(a)—/())]/(b— x) is of limited 
variation. 

Now let f(x) be of period 27, and (for simplicity of 
statement) provided with a first derivative of limited variation 
over a period. The ‘function (x— a) cot }(a—a), defined 
so as to be continuous for s = a, has a continuous derivative 


` for a <2<a+a, and so is of limited variation over this 


interval. Hence the product 
E —f(@] /(a@—a)} \(e—a) cot s (w—a)} 
= [f(0)—f(a)] cot 5 (@—a) 


is of limited variation over (a, a+ 7), since this is true of each of 
the expressions in braces. Similarly, (2—a—2z) cot }(a—a) 
is of limited variation for a+ <2<a-+-27; asf(a+20)=/(a), 
the expression [/(x)—/(a)]/(a+2a—.2) is the same as 
[/(@)—f (a+ 22)]/(a4+-2a— x), which is of limited variation 
over (a+, a+2); and so [f(x)—/f(a)] cot }(a—a) is of 
limited variation over this interval also. In summary, and in 
slightly different notation, the expression [/(¢)./(z)] cot} (t-s), 
regarded as a function of ż, is of limited variation over any 
interval of length 27. 

Let the interpolating sum for f(x) be expressed once more 
in the form used in connection with the Corollary of Lemma IV. 
The interpolating sum for a constant reproduces the constant 
identically; since f(z) is a constant with respect to the index 
of summation 7, the quantity given by the formula reduces 
identically to m f(x), if f(t) is replaced by /(~) under the 
sign of summation, and 


www.rcin.org.pl 


IV. TRIGONOMETRIC INTERPOLATION 141 


m [Sn (x) —f(x)] 
= cosn > [L/(t,)— f(a] cot (t,— x) sin nt, 


—sinna X (f(t) —f(@)] cot = (t,— x) cos nt, 


+em cos nx X [f(t,) —f(a)] cos nt, 
+em sinnae X [StS] sin n t. 


But f(— f), as well as [f()—/f(@)] cot $(t—2), is of 
limited variation with respect to t, and consequently Lemma IV 
is applicable to each of the sums on the right. Closer 
inspection shows that the total variation in each case is not 
greater than a constant multiple of the total variation of 
JS'(z) over a period. If this fact is incorporated in the 
statement of the result, the conclusion may be formulated as 

THEOREM V. If f(x) is a function of period 27 having a first 
derivative with limited variation, the total variation of f (x) 
over a period being V, then 

S(x)—Sn(z) < CV/n, 
where C is an absolute constant. 

The corresponding analysis for the case of functions having 
higher derivatives will not be carried through here. It may 
be pointed out, however, as a first stage in the extension, 
that if f(x) has a continuous non-decreasing second derivative 
for a < x < b, then [f(2) —f(a@)]/(a—a), considered to have 
the value f'(a) for x = a, has a non-decreasing first derivative 
over the same interval. For x= a, 

+ Pah =- Ew, 
as may be seen by calculating the derivative from first 
principles and applying the extended mean value theorem in 
the process. For «>a, 
A fa — r T a) f (x) SMEA] 
= 


x—a (a — a)? 


rO z "o, 
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the number § lying between a and x. For s> a, furthermore, 


de | fæ- _ eof" aar aAA] 
dz*| x—a e (x — a)? 
EA y 
T— Q 


The monotone character of the first derivative thus becomes 
apparent. It follows that if f(x) has a continuous second 
derivative of limited variation, [/(@)— /(a)|/(a— a) has 
a first derivative of limited variation, and connection can be 
made with the facts previously ascertained as to functions 
satisfying the latter condition. 


6. Formula of interpolation analogous 
to the Fejér mean 


A considerable part of Chapter II was devoted to a dis- 
cussion of the arithmetic mean of the partial sums of the 
Fourier series. There is a corresponding formula in the case 
of interpolation, possessing many analogous properties, with 
the outstanding exception that it is not an arithmetic mean 
of a sequence of the interpolating sums previously studied. 
It is to be defined and examined on its own merits, with 
only incidental reference to the content of the earlier part of 
the present chapter. 

Let n be an arbitrary positive integer, and let t = 2r z/n. 
for any integral value of r. In comparison with the earlier 
notation, m is now to be taken equal to n, instead of 2n -+1 
or 2n; the sign > will be understood to refer to summation 
over n successive values of the index +, when there is no 
indication to the contrary; a separate symbol m is no longer 
needed; and there is no occasion to distinguish between odd 
and even values of n. 

Let f(x) be an arbitrary function of period 27. The inter- 
polating formula in question is 


S sin? $n (tr — x 
On (x) = LS (t) AS preset 
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with the understanding that each term is defined so as to be 
continuous wherever its denominator vanishes. For x = t4, 
one term of the sum reduces to /(¢,), namely that in which 
r = q, or in which v differs from g by an integral multiple of n, 
while each of the other terms becomes zero, and consequently 


Gn (te) = J (Ee). 


So the interpolating property is apparent at the outset. 
It is an almost immediate consequence of identities previously 


employed that 
(sin 5 n v) / (sin? 3 v) 


1 
= JE n(n cosv+(n—2)eos2v -+-+ +eos(n—)r|. 
Hence, as a result of the substitution v = t, — z, 


On (2) = = (ty + dy COS -+ tg COS2a+ «++ + an 008(n — 1) 
by sing +d, xin Qe... 43,4 ain (n—1) ar, 
with the coefficients 
2) —k 


Ke = ) D f(t) cos k tr, 


2 a> 


bk = =% a: (t,) sinkt. 


The expression is a trigonometric sum, but of order about 
twice as high as would be required merely for the purpose 
of obtaining coincidence at n points. 

If f(x) = 1, all the coefficients reduce to zero except ao. 
while $a = 1, so that in this case o,(x) Dy am T 
(sin? nu)/(sin*x) is denoted by ®, (x), 


pis epa = D (inho |; (t, —»)| f, 


n? sin 


Hence it follows further, inasmuch as ®, (u) is never negative, 
that if f(x) is any function having M as an upper bound 
for its absolute value, 


www.rcin.org. pl 


144 THE THEORY OF APPROXIMATION 


Gyula) = > (1/n*) f (tr) OD, $ (,—2)| | 
< > (M: n?) o,| | (,—2)| = M, 


Like the arithmetic mean associated with the Fourier series, 
the present o,(x) converges uniformly toward f(s), as 
n becomes infinite, if f(x) is everywhere continuous, and 
converges at points of continuity under more general hypotheses 
as to the behavior of the function elsewhere. Inasmuch as 
the sums depend on the values of the function at isolated 
points, however, it is necessary to impose some restriction 
on the values which it may take on point by point, not 
merely to require that it be summable, or in other words 
that it have a finite mean value. 

Let f(t) be continuous for t= v, and let f(t) < M 
everywhere. By a device already used on a number of 
occasions, the error of on(x) can be expressed in the form 


o, (2) —f (x) = a (1/n®) L/ (ty) — f (x) Da [ ; (t.— a| K 


Let e be an arbitrary positive quantity, and let ð be a positive 
number such that | /(#)—/ (a) <4« for t—a2 <ð. Let the 
sum in the right-hand member of the identity for n(x) — f (£) 
be written in the form >’+ >”, where D’ denotes summation 
over those values of r for which ¢,—. differs from an integral 
multiple of 27 by less than ð, and X” stands for a summation 
covering the remaining values of r. mX’, f(t)—f (x) <4e, 
so that 


Dian f)—L | z (te »| 


< >" (1/n®) (€/2) JE ,—2)| 


< > (1/n®) (e/2) On É —) = : e. 


In D”, ®, [3 (t-—z)] < 1/(sin® $6), and F(t) —f a) < 2M, 
while the number of terms can not exceed n, and consequently 


www.rcin.org.pl 


IV. TRIGONOMETRIC INTERPOLATION 145 


et /n®)[ f(t-)—f (a)] Dn | L (,—2)| l< 2M/(n sin? > ò) : 


which is less than }¢ as soon as n is sufficiently large. This 
establishes the fact of convergence. It is sufficient that » 
surpass a bound depending only on M and ð; if f(x) is con- 
tinuous everywhere, ð can be chosen independently of r, 
and the convergence is uniform. If f(x) is continuous for 
a— y < r < +y, without being necessarily continuous 
everywhere, it is possible to choose a ò<» which shall be 
valid for all values of x in the interval « < x < 8, and 
the convergence is uniform over the latter interval. Finally, 
if f(x), instead of being merely continuous for æ — y < «x 
< 8+ 4, is identically zero there, and if x is given a value 
belonging to the interval («, 8), Dn [b(t — x)] < 1/(sin® $y) 
in all terms in which f(t) + 0, and 


Gn (x) < M/(n sin" }y). 


The results may be summarized in 

THEOREM VI. Jf f(x) is a bounded function of period 27, 
G» (x) converges toward f(x) at every point at which f(x) is 
continuous. If f(x) is continuous everywhere, the convergence 
is uniform everywhere. If f(a) is continuous for @— ù < x 
< B+, the convergence is uniform for «<a < 8. If 
f(x) is identically zero for @—y Sax < B+y, næ) 
< C,M/n for @ < x < B, where M is an upper bound for 
J(=), and C, is a constant depending only on y. 

Further discussion of the degree of convergence of On (x) 
will be limited to the case in which /(~) satisfies the condition 


f) —f(@s) S Alar a4). 
Suppose first that this condition is satisfied everywhere. Then 


Sn (x) — f(x) < (4/n*) b tr — 2 o| (t, — »)|. 


For any particular value of x, let tę be that one of the 
numbers ¢, which is nearest to x, or one of the two nearest, 
if x is equally distant from two of them. so that tg — (7/n) 
< x < tr+(a/n). Let the summation be thought of as 


10 
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extended specifically over the » values of + for which s — a 
< t< «+a. The precise expressions for the extreme values 
of r in terms of R and » will vary according to circumstances; 
they will be approximately R+-}n, and it is sufficient for 
the purposes of the present argument to note that they will 
certainly be between /?—n and R-+-n, for any n>1. 
For any value of v, 
D, (5 e) e| z n+(n—1)eose+(i—2) COs 20+ .---+eosin— 1 | 


- 


<f n+(n—1)+(n—2)+--- +1] = hh 


and hence 
: 1/2 
sin nv l ‘ile 
—— | = | ®, | SUN Ss 
sin $r 2 


Furthermore, 0 < v/singjv <a for v < a. Throughout 
this interval, consequently 


Rahs v singar ad 
v Dy, |- v) = ——— - | ———_|-|sin—nv| < mnt, 
é | 2 sindy singe 2 a 
and at the same time (for v + 0) 
1 1 v? ETN | 1 
v D v) = —- >: sin” nv < —- m]: 
i | 2) v| sin’ tv 2 = j0 


These relations are to be used in connection with the 
inequality for o,(2)—/(x) in the second paragraph preceding. 
If x is one of the numbers ¢,-. 6, (a) = f(x), and there is 
no further question as to the magnitude of the error. If « 
does not coincide with a ¢,, there are just two values of r 
in the summation for which 4 — r <22/n. In the terms 
of the sum corresponding to these values of 7 it is sufficient 
to apply the relation v ®, (40) < na, or h— e ®,[4(4-—2)] 
< na. For the other points ¢,, in the order of increasing 
distance from x on each side, the values of #.—sa are 
successively greater than the numbers 27/n, 44/n, 620/n, +++, 
and by the relation v ®,(4v) < a° v the corresponding 
values of t;-—ax ®,[4(t,—«)] are respectively less than 
nm/2, nag, nav/6, +++, Hence, as the sum involves not 
more than » points 4, on each side of r, 
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= 1 
> lira o,| 5 t — a)| 
< Ina +2 [na/2 4na 4na + -+ nall n] 


UF 1 
= na|2+1+4+ ate + z 


< nala+f a] = na (3 +log n), 


~ 


which for n > 2 does not exceed a constant multiple of 
nw logn. This makes it possible to state 
THEOREM VII. Jf f(x) is a function of period 2a satisfying 
everywhere the condition 
fer) —f(@,)\ < Am—ay, 
then, for all values of n> 2, 
F) — 6, (2) < (Clog n)/n, 
where C is an absolute constant. 
The conclusion may be generalized at once by combination 
with the last assertion of Theorem VI, to yield the 
COROLLARY. If f(x) is a bounded function of period 2a 
satisfying the condition 
J (a) — f(a) < Alem 2, 


throughout the interval @—y<a2<— B+ 4, then 
T(x) — onla) < (eå log n)/n 


Jor «Sx < Rĝ, where c is a constant depending neither on x 
nor on n. 

In conclusion, it may be pointed out as a peculiarity of 
the interpolating sum o» (x) that ¿ts derivative vanishes at each 
of the points ty (See L. Fejér, Göttinger Nachrichten (1916), 
pp. 66-91, especially pp. 87-91). For in the identity 


On (7) = j% (1/n*) FC) Dn | 5 (t — »| 


it is evident from the expression of ®, in fractional form, 
or can be verified by differentiating this expression, that 
10* 
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@,,[4(t,— x)] has a double root for « = ¢, it t» is not con- 
gruent to ¢, modulo 27, while the representation of On [} (t —x)] 
as a sum of cosines shows that it also has a vanishing 
derivative for s = ty. 


7. Polynomial interpolation 

As was indicated in the opening paragraph of the chapter, 
the methods that have been set forth are not adapted to the 
study of the problem of polynomial interpolation with equally 
spaced points. That problem is analogous rather to the theory 
of Taylor's series, whether treated by means of Taylor's 
theorem with the remainder for real variables, or by Cauchy’s 
theorem in the complex plane. A simple change of variable, 
however, serves to carry over the formulas of trigonometric 
interpolation to a case of polynomial interpolation with 
unequally spaced points distributed in a certain way. Suppose 
namely that a function / (2) is defined for —1 < x < 1. Then 
f (cos 6) is a function defined for all values of Ø. It is an 
even function of @, and inspection of the formulas defining 
the coefficients in the interpolating sums S),,(4) and on (4) shows 
at once that these sums involve only cosines, and so may 
be regarded as polynomials in cos 6. If x= cos 8, S,(@) 
and on (8) may be denoted by P, (a) and a, (x) respectively. 
They are polynomials agreeing in value with f(x) for a set 
of values of æ corresponding to equally spaced values of 4. 
The polynomial 7, (x), unlike the interpolating polynomial of 
minimum degree for equally spaced values of x, converges 
in the case of every continuous function f(x); its degree how- 
ever is approximately twice as great as the number of points 
for which coincidence is obtained. The expression Pha (x), on 
the other hand, is an interpolating polynomial of minimum 
degree, and while it is not convergent for every continuous 
function, it converges far more generally than the correspond- 
ing polynomial with equally spaced points. It is unnecessary 
to enumerate the further theorems on convergence and degree 
of convergence which would be obtained by following out 
the transformation of variable in detail. 
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CHAPTER V 


INTRODUCTION TO THE GEOMETRY OF FUNCTION SPACE 


1. The notions of distance and orthogonality 


In Chapter MI, attention was directed to the problem of 
the approximate representation of a given function by means 
of linear combinations of other given functions, according to 
the criterion of least squares. If f(x) is a given function 
over an interval (a, b), and if p (£), po(x), ---, pm(~) are 
m linearly independent functions over the same interval, the 
coefficients c1, «++, Cm in an expression 


g (x) = cy pr (2) + ce po (£) +++ ++ Em Pm (x) 


are to be determined so that 


froo 


shall be a minimum. The value of the integral is taken as 
a measure of the discrepancy between the functions f(s) 
and ¢ (x). 

The problem is the same in principle as that of the method 
of least squares for the approximate solution of a set of linear 
equations. Suppose there are n equations in m unknowns, 
n>m: 

tri Ti F ay Ta H: + Amt m = bi, 


Aye Ty F es ta Fee- Ama Xm = be, 
Min 2 F lanta F- -+ F Amn tm bn. 


The set of numbers (b,, ba, ---, bn) may be regarded as con- 
stituting a function b(k) = bk, in which the independent 
variable % takes on only a finite number of values, the 


integers from 1 to n. In the same way, each of the sets 
149 
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(dia, dm, +++, Qin) May be regarded as a function aj(k). The s's 
then are the coefficients in a linear combination of these 
functions, 

we = wlk) = x a (hk) + xe as(k) + <- -+ am dm (k), 


and are to be determined so that 


D (bie — we)? 
ck? 


shall be as small as possible. The sum of squares this time 
measures the discrepancy between the approximation and the 
function approximated. 

The form of the sum suggests a geometric interpretation, 
in which the functions ) (7%), w(K) are represented by points 
in n-dimensional space, with codrdinates (b, by, ---+, ba) and 
(Wi, We, +++, Wn) respectively, and X (b: — wy)? is the square 
of the distance between these points. By an extension of 
the same idea, the functions f(x), p(x) of the first paragraph 
are thought of as corresponding to points in a space of 
infinitely many dimensions, with f|[/(@)— ẹ()} da as the 
square of the distance between them. This definition of 
distance is the beginning of a systematic geometry of func- 
tion space. 

Another hint of geometric analogy which is present from 
the beginning consists in the recurring use of the term orthog- 
onal, two functions being called orthogonal to each other 
over an interval when the integral of their product over the 
interval is zero, While it is worthy of some emphasis that 
orthogonality of the functions which form the basis of the 
approximation is a convenience rather than a necessity, the 
notion of orthogonality is inseparably associated with the 
least-square condition in another way. 

Let the functions (<), pı (£), ---, Pm (£) be supposed con- 
tinuous, for simplicity of illustration. (It would be sufficient 
that they be integrable together with their squares, provided 
it is assumed that the p’s are properly independent, in the 
sense that every linear combination of them containing a non- 
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vanishing coefficient is different from zero over a set of positive 
measure; this condition is of course satisfied automatically if 
they are linearly independent and continuous.) Zn order that 
Cr, -*+, Cm be the coefficients giving the least-square approxima- 
tion, it is necessary and sufficient that f(a) — y (x) be orthog- 
onal to each of the functions pi(v), or in other words ¿t is 
necessary and sufficient that f(x)— p(x) be orthogonal to 
every linear combination of pi (£), +++; Pm (av). This can be 
verified algebraically, without a detailed examination of the 
necessary and sufficient conditions for a minimum in the 
calculus. 

Let W(x) be an arbitrary linear cambination of the p’s 
with at least one non-vanishing coefficient. Then 


w(x) ola) + h y (æ) 


is a linear combination of the p’s, for any value of /, and 
every linear combination different from g can be written in 
this form by a suitable choice of W. Let 


b 
1= | Ferdz, 
è) èb 
P; =f [ f(x) ol de =- f MEAE Eg (x)] fa h WDF dx. 


ah 
k= Peg) widz, 8S =f [Ya de> 0. 


Then J = 1—h(2R—hAS). If R4$0,% can be chosen so 
as to make J< J (for example by taking 4 = R/S), contrary 
to the supposition that is the minimizing function, while if 
AE S E I+èS>1 for every h + 0. 

The conditions [[fia)—9 (x)] pila) dæ = 0, i= 1, 2, ---, M, 
are equivalent to the m linear equations 


me 


ap 
p> Cj f: Pi (æ) Pi (x) dx =Í; S (a) pi (2) dx 


for determining the m c’s. It has been shown that these 
equations must be satisfied if the least-square problem is to 
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be solved, and that the problem will be solved if the equations 
are satisfied. For the moment there is still a question whether 
the equations have a solution. The answer is immediate, 
however; the left-hand members are independent of the func- 
tion f(x), the least-square problem obviously has the unique 
solution ¢ = ta = +++ = €m = 0 in the particular case 
J (x) — 0, and consequently the determinant of the coefficients 
must be different from zero. If the p’s were not linearly 
independent, the problem of approximation could still be solved 
in terms of a linearly independent subset of them, and this would 
be at the same time a solution in terms of the original set 
of p’s, but the solution in terms of the latter set as a whole 
would not be unique. 

Similar reasoning is applicable to the problem of the second 
paragraph of the chapter. Integrals are to be replaced by 
sums throughout, and in particular the property of orthogonality 
of two sets of numbers is expressed by the vanishing of the 
sum of the products of corresponding numbers of the two 
sets. The hypothesis of linear independence of the p's corre- 
sponds to the condition that the matrix of the coefficients aj, 
be of rank m. The condition of orthogonality characterizing 
the least-square solution takes the form that the set of 
residuals bj.— wx is orthogonal to each of the m sets of numbers tir: 


>` (br — wk) tir = 0. 
k= 


The 7th equation of this set may be constructed by multi- 
plying each of the given equations by the corresponding co- 
efficient of x; and adding the equations thus obtained. Written 
out at length, the new equations have the form 


n n n F 
~% 
(> lik aux) + (2 ik an) zat. + | 2 (ik m) ten 
\kK=1 ‘=1 =1 


h 
= > an bp, i= 1,2,- m. 


k=1 


They are the well-known “normal equations” for the solution 
of the problem of least-square adjustment. 
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In certain cases the necessary condition of orthogonality 
can readily be translated into another familiar form. Consider 
once more the case of functions of a continuous variable v. 
Let f(z) and e(x) be given functions for a < s < b, for 
` simplicity continuous, and let ọ(x) be non-negative and not 
identically zero over the interval, and let Pa(x) be the poly- 
nomial of the nth degree which minimizes the integral 


h 
if ola) L6) — Pr] dx. 

Then the remainder f (x) — Pa (x), if not identically zero where- 
ever o (7) +0, must change sign at least n+-1 times in the inter- 
val (a, b). Tf [oe (]"? is denoted by q(x), the integral to be 
minimized is the same as Sla@s@)—a(a) P,,(x)]*? dx, and 
the problem is that of approximating q(x) f(x) by a linear 
combination of the functions g (a), xq (£), -+ -, z” q (£). By 
the general proposition obtained above, it is necessary that 
q (x) f (xv) — q(x) Pa (x) be orthogonal to every linear com- 
bination of the functions z*q (x), or, in other words, orthog- 
onal to q (x) Qn (x). if Qu (x) is an arbitrary polynomial of 
the nth degree; in symbols, since [qg D]? = o (2), 


wi 
fi o (x) [F E) — Pala) Qr) dæ = 0. 


If f(x) — Pa (x) had not more than n changes of sign, it 
would be possible to construct a Qha(x) having the same 
sign as f (x)— P, (x) at all points where f (x) — P, (x) + 0, 
and this would make the integral positive, unless o (x) = 0 
wherever f(z) — Pa (<) 0. The condition that P, (x) be 
such as to give the remainder the requisite number of changes 
of sign is of course not sufficient for the solution of the 
least-square problem, since it is satisfied by any polynomial 
differing sufficiently little from the minimizing polynomial. 
A corresponding formulation is possible in the case of ap- 
proximation by trigonometric sums, with or without a weight 
function. 

The least-square problem, in general as well as in the 
special cases discussed in the preceding paragraph, is that 
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of choosing from a given linear family of functions the partic- 
ular one which is closest to another given function, according 
to the measure of discrepancy specified. The geometric inter- 
pretation that has been suggested gains in clearness if each 
function, instead of being represented merely by a point in 
a space of an appropriate number of dimensions, is also 
represented alternatively by the vector from the origin to 
the point in question. The linear family of funetions then 
corresponds to a linear spread in the geometric picture, and 
the function of closest approximation corresponds to the point 
of this spread whose distance from the point representing 
the function to be approximated is as small as possible; the 
residual function f(x) — » (a) (or b(k)— w (¥)) can be regarded 
as the vector from ọ (a) to f(x) (or from w (X) to b (%)); and 
the property of “orthogonality” characterizing it is associated 
with the fact that the shortest distance from a plane spread 
to a point outside it is perpendicular to the spread. The geo- 
metric terminology thus acquires additional plausibility. 


2. The general notion of angle; 
geometric interpretation of coefficients of correlation 


An obvious further step is to proceed from the notion of 
orthogonality to a general notion of angle in funetion space. 
As will be seen presently, the definition of angle is already 
implicit in that of distance, if the frame of Euclidean geo- 
metry is to be fitted consistently to functional relations, 
though the question as to the possibility of carrying the idea 
through systematically and without danger of internal contra- 
diction still calls for some elucidation. 

It will be well first to adopt a common notation for dealing 
simultaneously with functions of a continuous variable and 
functions of a discrete subscript. If x(t), y(® are continuous 
functions of t for a< t< b, let 


(a-y) = f z0 di. 


If £i, Lo, +++, Eni Yi, Yas +++; Yn are two functions of an index / 
which ranges from 1 to n, let 
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(z. y) = = Ek Yi» 


The quantity (x-y) is the inner or scalar product of the two 
functional vectors. In particular, (x.x) = Xak or | ædt, 
ete. The condition for orthogonality of x and y is that 
(x+y) =0. A third case worthy of explicit mention is that 
of infinite sequences tı, 72, -+-3 Yi, Yes +- Such that Sa? 
and Sy; are convergent. The interpretation then is that 


feas] 
(x.y) =% Pk Yk. 


All three are of course isolated particular cases from the 
point of view of general analysis. 

Let x(4), y(t). or x(k), y(k), be two functions represented 
by the points P, Q respectively, and let O be the origin, 
corresponding to a function which vanishes identically. By 
the convention already adopted, the sides of the triangle OPQ 
are 

OP*= (z-x), OW = (y-y), 
PQ? = ((y — x): (y—2)) = Y-y) — 2(@-y) + was). 


If the angle POQ is denoted by 6, application of the law 
of cosines to the triangle gives 


PQ = OP? + 0? — 20P- OQ cos 4, 
whence 20P-0Q cos # == 2(x-y), and 


(x-y) 
oso = ma 
cos 0 (æ x)! (y y)? 
When the independent variable is k = 1, 2,---.m, the 
formula becomes 
> tk Yk 
CoS O = -an aAA OI 
(De) (Dy 


This is recognized immediately as a fundamental formula in 
the theory of statistics. Jt is the coefficient of correlation 


www.rcin.org. pl 


156 THE THEORY OF APPROXIMATION 


between the variables x and y, if these have been reduced 
to the form of deviations from their respective arithmetic 
means, or, in other words, if X zk = Sy, = 0. Subsequent 
pages will show that the geometry of function space throws 
much light on the structure of more complicated formulas of 
correlation. 

It becomes important to inquire more closely as to the 
logical basis for the identification of analysis and geometry. 
The essentials of such a basis are implicit in the considerations 
leading up to the proof of Theorem III in Chapter IJ. To 
repeat in the present notation what is needed for the purpose 
in hand, restricting attention at first to a two-dimensional 
spread, let x, y be any two functions which are linearly in- 
dependent over one of the ranges specified in the third para- 
graph preceding (or over some other appropriate range); if 
the range is an interval, let it be supposed for convenience 
that the functions are continuous. Let 
(& y) 

(E 8) 


i, Fe m = y— Äi 


It follows immediately from this definition that (§,-4,) = 0. 
By the hypothesis of linear independence, furthermore, it is 
certain that 4, is not identically zero. If 


§ = §,/(§- 8), n= mhm m)’, 


the functions §, 7 are orthogonal to each other: (§-7) = 0; 
and they also satisfy the condition that (§-§) = (4-4) = 1. 
They are linear combinations of v and y, and it is seen at 
once that the determinant of the coefficients is different from 
zero, so that x and y conversely can be expressed as linear 
combinations of & and y. 

The functions x and y being given, let § and y now be 
any pair of functions such that x and y are linearly expressible 
in terms of ë and 7, and such that 


E.g). = 0, (E-E) = (9-9) = 1. 
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It has been shown that such functions ë, 7 can be constructed. 
There are infinitely many pairs satisfying the requirements; 
the original §,7 can be replaced by @,&-+ 8,7, s E+ Bah, 
if «1, i, @, By ave any constants such that 


«,«@,+28,8, = 0, a B= e+ = 1, 


Furthermore, the requirements can be satisfied even if s and y 
are not linearly independent; it suffices then to express x, y 
in terms of two linearly independent functions 2;, yı, and to 
construct functions §, as above with «, and y, in place 
of x and y. 

In terms of a chosen pair of normalized orthogonal 
functions §, 7, let 


a= aS tht, Y = üa + ban. 


(a x“) = a aa I, (y y) = a; +. be, 
(y—x)-(y—a)) = (a,— u) + b, — b). 


If the functions x, y are thought of as corresponding to the 
points P, Q, with the coördinates (a, bı) and (as, ba) respectively 
in a rectangular coördinate system, while O is the origin, the 
quantities (x-x), (y-y), and ((y — x) - (y —x)) are the squares 
of the distances OP, OQ, and PQ. The cosine of the 
angle POQ is 


Then 


t ty + bi by LO 
FEH E y * 


More generally, if 
u=Aatmy=—AS+Biy, v= lxt uy = A E+ Ban 


are any linear combinations of x and y, and if R, S are 
the corresponding points (Ai, By), (As, Bs), then (ru), (v-v), 
and ((v— u). (v — u)) are the squares of the distances OR, OS, 
and RS, and the quantity (u-v) is the scalar product of the 
vectors OR, OS. Every linear combination 42-++my corre- 
sponds to a definite point with coördinates (å a, +u ts, Abiu be); 
if z and y are linearly independent, a one-to-one correspondence 
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is established between functions linearly dependent on them 
and the points of a plane, in such a way that there is an 
actual identity between the measures of distance and angle 
and the quantities which were associated with them originally 
by analogy. So far, moreover, there is no need of any space 
of more than two dimensions. 

The correspondence being once established, the analytical 
relations expressing geometric facts are implied with logical 
conclusiveness by the geometric facts themselves. Consider 
for example the problem of determining a constant 2 to mini- 
mize ((y—Ax)-(y—4x)), when x and y are given. If s 
and y are interpreted as statistical variables, representing 
deviations from the respective arithmetic means, 2 is the 
coefficient of regression of y on x. The points corresponding 
to x and y being P and Q, as before, the function 2. is 
represented by a point M on the line OP, and y — 2 corre- 
sponds to the vector MQ, in the definite sense that when 
y—A4«x is expressed as a linear combination of § and y. 
the coefficients of & and y are the components of the vector. 
It is clear from the geometric figure not only that WQ must 
be perpendicular to OP, or in other words that y — 42 must 
be orthogonal to x, but also that the value of 2 which 
accomplishes the purpose is given by 

2-OP = OM = OQ cos 6, 

— cos 9 = wy” ‘ @ iy) my = SHS 
(x-1) (æ. x)? (yy)? (x.x) 
where @ is the angle POQ, and furthermore that the minimum 

value of ((y — 4x)-(y—A2)) is 

MQ? = (OQsin 6)? = (y - y) (1— cos? 0) = (y - y) 0— r°), 
if cos 6, interpreted as a coefficient of correlation, is denoted 
by r. 

For the geometry of three (linearly independent) functions 
z, Y, 2, let Ss, 41, S, y be defined as before, and let 
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Then ¥, 4, Ẹ satisfy the conditions 


&-)=€-D=%-9=0, €-)=@-)=C-09=1, 


Ni 


and a, y, 2 are linearly expressible in terms of them. There 
are infinitely many sets of functions satisfying these same 
conditions, and serving equally well for the representation 
of x,y, z, any such set being expressible in terms of the 
particular set §, 7, ¢ by means of the coefficients of an orthog- 
onal linear transformation in three variables. Any linear 
combination u = 2w -+ py +r: can be expressed in the form 
AE+ B+ CE, and can thus be put in correspondence with 
a point (A, B, C), the square of whose distance from the 
origin is the quantity (u-u) = 4A*+ B>4 C°. If u and v 
are two such linear combinations, corresponding to the points P 
and Q. the cosine of the angle POQ is (u » v)/[(u < u) @ - P”. 

The representation of functions by points or vectors is 
particularly convenient for the visualization of coefficients 
of partial and double correlation. Let «, 4,2 be three given 
functions, corresponding to points P, Q, R in three-dimensional 
space. If a, y, z are sets of deviations from arithmetic means, 
so that the statistical terminology is appropriate, the co- 
efficient of partial correlation between x and y, when z is 
held fast, is the coefficient of correlation between a — Zz and 
y— uz, Where å and x are the regression coefficients of s 
on z and of y on z respectively. The function #— Zz is 
a linear combination of s and z, orthogonal to 2; its geo- 
metric counterpart is a vector in the plane POR, perpendicular 
to OR. Similarly, y— mz is represented by a vector per- 
pendicular to OR, and lying in the plane QOR. The angle 
between these vectors measures the dihedral angle formed 
by the two planes. So the coefficient of partial correlation 
in question is the cosine of the dihedral angle. Let P,, Qi. 
R, be the points in which the rays OP, OQ, OR pierce the 
unit sphere about the origin as center. In the spherical triangle 
P, Q Rı, let «, 8,7 be the measures of the angles P,, Qi. 
R, respectively, and let a,b, e be the sides opposite these 
angles. Let rs, ris, ss respectively be the ordinary cor- 
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relation coefficients of x and y. x and z, and y and 2; let 
rı2.3 be the partial correlation coefficient which has just been 
discussed, and let ris. and reg. be the other coefficients of 
partial correlation. It has been seen that 72.3 = cosy. In 
the same way, 713.2 = COS Â, r23. = cos œ, while 7i = cosc, 
ris = cos b, ras = cosa. By the law of cosines, 


cos € — cos 4 COS Ù 
sin a sin b 


cos y = 


which means that 
= = Vis — is 23 
E S > 
[t—) G—ih or 

This is the standard formula expressing a coefficient of partial 
correlation in terms of ordinary correlation coefficients. The co- 
efficients 713.2, 723.1 of course have corresponding expressions. 
The inverse formulas 


T12.3 -+ 113.2 T23.1 


= M #,,) d— re 


ete., are similarly obtained from the polar triangle. 

For the definition of one of the coefficients of double cor- 
relation, parameters 2, œ are to be determined so as to mini- 
mize the quantity ((@ — ły — mz). (x — ły — uz)), or iù 
other words to give the least-square approximation of a by 
means of a linear combination of y and 2. This requires that 
x— Ìy— uz be orthogonal to both y and 2; geometrically, 
the point M representing the combination 2y + æz is the foot 
of the perpendicular from P on the plane QOR. Then 7.23, 
the coefficient of double correlation between a and the pair 
of variables y and 2, is the coefficient of simple correlation 
between x and 4y-+- uz, the cosine of the angle MOP between 
OP and the plane QOR. Let this angle be denoted by 4. It 
is measured by the are P, S,, if S, is the foot of the altitude 
from P, to the side Q, R, in the spherical triangle. Hence / 
may be calculated as a side of the right triangle R, S, P,, 


by the formula 
sin ) = sin b siny. 
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By substitution from the relations 


sin? b = 1— 7%, 
siny = 1—72,, = reno ie BREST AS BE 
12.3 (1 aay ri) (1 = ra 


it is found that 


ja 


(i + Tis — 2ta TiTa)” 
~ aSa 
There are corresponding formulas for the other double cor- 

relation cofficients 72.3 and rg... 

The same figure may be used to obtain formulas for the’ 
coefficients 4 and mw, the partial regression coefficients of 
x on y and z. Let lines be drawn through M parallel to 
OR and OQ, meeting OQ and OR at K and L respectively, 
to form a parallelogram OK ML. The vectors OK and OL, 
constituting a resolution of OM into components collinear 
with OQ and OR, represent separately the terms 4y and we. 
On the surface of the sphere, let the ares R, S, and Sı Q, be 
denoted by d and e, so that d measures the angle LOM, 
while e measures the angle MOK and its equal OWL. (The 
formulation is adapted throughout to the case in which the 
point M falls within the angle QOR, so that S, is interior 
to the are Q, Ri.) In the plane triangle LOM, having two 
of its angles equal to d and e respectively, the third angle, 
at L, is the supplement of d-+e. But the ares d and e on 
the sphere make up the side «æ of the original spherical triangle. 
So the law of sines in the plane triangle gives 


ri. = cosh = (1— sin? h)” = 


OL sin e sine 


OM  sin(d+e) sin a‘ 


In the right spherical triangle P, Sı Qı, on the other hand, 


3 sinc cos 8 
sine = —————_,, 
cos h 
so that 
OL = OM sinc cos 8 


sin a cosh 
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while OL = w-OR, and OM = OP cosh. Hence 


OL OP sine 
“= OR ~ ORsna °°" 
or with the use of a conventional notation, presently to be 
explained at greater length, for regression coefficients and 
standard errors of estimate, as well as that already used for 
coefficients of correlation, 


01.2 
bis. = 114.2 
63.5 
Similarly, pias 
sin 1.3 
b.s = A = COS YX = — Ti. 
12.3 OQsina dz 12.3 


The results of the preceding paragraph can be obtained by 
a possibly less straightforward but more exclusively geometrical 
method, making no use of the spherical triangle or of spherical 
trigonometry as such. Let the letters O, P, Q, R, M, K, L 
have the same signification as before. The letters a. b, ¢, 
«, B, y retain their previous meanings as measures of the 
face angles and dihedral angles of the trihedral angle O-PQ R. 
Let H be the foot of the perpendicular from M on OQ, and 
J the foot of the perpendicular from R on OQ. The triangles 
KMH and ORJ in the plane QOR are similar, since K M 
is parallel to O R (having been so constructed), M.H is parallel 
to RJ (both being perpendicular to OQ), and KH and O./ 
are collinear. Furthermore, K M and OL are opposite sides 
of a parallelogram. So. 


OL KM _ MH 
DE Da “Ber? 


But the length of RJ, the perpendicular from Æ on OQ, is 
ORsina. Also, PH is perpendicular to OQ, since MP, being 
perpendicular to the plane QOR, is perpendicular to the 
line OQ, and OQ, being perpendicular to MP and MH, is 
perpendicular to their plane and to the line PH in that plane; 
hence MHP is the measure of the dihedral angle 8, so that 
MH = HP cos8 (as PMH is a right angle), while HP 
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= OPsinc, making MH = OPsinccos8. The resulting 
expression for w is the same as before. 

The preceding calculations are not restricted in substance 
to the case of statistical variables, but can be formulated 
without the terminology of correlation, and are then applicable 
to any functions z, y, z coming under the original hypotheses. 
It is to be emphasized also that for the time being no use 
has been made of space of more than three dimensions: the 
geometry is the actual geometry of experience. 


3. Coefficients of correlation 
in an arbitrary number of variables 


For dealing with relations of higher complexity it will be 
convenient to modify the notation somewhat. When there 
are m functions to be considered, let them be denoted by 
Li, Loz +, 2m. The case of primary interest will be that of 
statistical variables, measured from an arithmetic mean in each 
case; i. e., xi will stand for a set of numbers za, ve, «++. Lins 
subject to the condition that sa -+ a+ --- + £in = 0. Apart 
from technical notation and terminology, however, the work 
will be valid for functions of any of the types previously 
considered. As an additional item of notation, (@-) will 
be abbreviated to ((z))*. 

The reduction of general coefficients of correlation and 
coefficients of regression to expressions in terms of coefficients 
of lower order depends on the following fundamental prop- 
osition: 

Let 4s, +++, Am be determined so as to minimize 


((a — Ag %— «+> — Am Xm))*, 
and with these values of the 2's, let 


w Tı — Ag ig — +++ — Am Tm. 


Let fig, +++, fm and vg, +++. Vm be determined so as to minimize 


(a, — fs Z — ++» — him Xm)? and (xz — vs £s — ++» — Vm Xm), 
and let 
P = Zr Pils ~> tm mem; Y= Le— Vg Xg— +++ — Vm Tm 


11* 
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Let A be determined so as to minimize ((g — A W))?. Then 
o) py— Ayp, 


The proof depends on the still more fundamental fact, imme- 
diately deducible from the definition of orthogonality, that 
if one function is orthogonal to each of two other functions, 
it is orthogonal to every linear combination of them. 

It is at once apparent from the detinitions of y and y that 
y — Awy is a linear combination of the z's, of the same form 
as œ, the coefficient of x, being unity in each case; the 
question at issue is that of the identity of the remaining 
coefficients. 

By a theorem discussed early in the chapter, a (necessary 
and) sufficient condition characterizing the coefficients in œ 
is that « be orthogonal to each of the functions a, ---, £m. 

By the same theorem, œ is orthogonal to each of the 
functions £s, +-+, £m, and WY is likewise orthogonal to each 
of these functions. Since as is orthogonal to g and to w, 
it is orthogonal to the combination y—<4wyW. The same is 
true of ay,--+,7». In other words, y— Aw is orthogonal 
to each of the functions x3, +++, Em. 

By one more application of the theorem, y—‘y is 
orthogonal to w. But a, can be expressed in the form 
Ty — WH gay+--+mam. Consequently, being orthogonal 
to W, a, +++; Em, 9 —AW is orthogonal also to x2. 

The identity of ¢—/w with w is thus established. 

For m = 3, the proposition is equivalent to a relation of 
perpendiculars which is important in deriving the formulas 
of spherical trigonometry: if OP, OQ, OR are three rays 
issuing from O, if N is the foot of the perpendicular from P 
on the line OR, if a line is drawn through N in the plane QO R 
perpendicular to OR, and if M is the foot of the perpendicular 
from P on this line, then M P is perpendicular to the plane QOR. 
This figure yields the equation sink = sinb siny, which was 
used in obtaining the formula for a coefficient of double 
correlation. Essentially the same configuration appeared also 
in the discussion of partial regression coefficients. 
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For the statistical application with an arbitrary value of m 
a somewhat elaborate notation is appropriate. Let the func- 
tion w, as originally defined by the formula 7;—A.a2—.-.—Amam 
according to the least-square criterion, be denoted by 21.23..... 
It may be spoken of as the residual of x, with respect to 
va, Xg +*+, tm. (The order of the subscripts 2, 3, +--+, m 
among themselyes is clearly immaterial.) Similarly, ọ and Wy, 
the residuals of a, and of sə with respect to a, +++, £m, 
are to be denoted by 21.3...n and 29.y...m. The standard 
deviation og of any one of the original variables 2; is defined 
by the equation og = ((a,))*/n. (The presence of the 
denominator n is a mere matter of definition, as far as the 
present discussion is concerned, since the equations will in- 
volve only ratios of standard deviations; except for the 
statistical interpretation, the (positive) square root of ((z))* 
itself may be used in place of cx.) The correspondingly 
defined standard deviation of such a residual as 2y,04...m, the 
standard error of estimate of xı in terms of w2,+++. Zm, İS 
denoted by  64,2%.,.,. The partial regression coefficients 
hg, 4s, +++, Am are represented by Dio.st..-m, Dis.24-..m3 ++ 
Dim.ox-..m—1. The first subscript of each of the b’s is that 
of the variable approximated, the second is that of the 
particular variable to which the coefficient in question is 
attached in the regression formula, and the other subscripts, 
the order of which among themselves is without significance, 
are those of the remaining variables. In the same way, 
fly, +++; m are to be replaced by Diy.4...m.+++s Dim.s--«m—1; 
and Va, °°+, Vm by Dog... <ms tee Oam. Simna The coefficient 
of correlation between ¢ and W, or, in the present notation, 
between @1.s...m aNd 2.3..., is the coefficient of partial 
correlation between xı and xı when Ts, ---, %m are held fast, 
and is represented by 7'12,3...m- 

In the new notation, the general proposition about the 
identity of the functions previously called œ and gy—‘/yW 
asserts that 21.93.... is the same as the residual of 21.9...m 
with respect to 22.3.... It is important to bear in mind 
however that its essential content is independent of the 
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number of variables and their individual designations, and 
this essential content is more adequately though less concisely 
expressed by saying: 

The residual of a given function with respect to a set of func- 
tions may be obtained by calculating successively the residuals 
of the given function and of any chosen function of the set 
with respect to the remaining functions of the set, and then 
taking the residual of the former of these residuals with respect 
to the latter. 

An ealier paragraph contained a derivation of the value 
of the simple regression coefficient minimizing the expression 
((y—42)-(y—Ax)). The formula of that paragraph which 
reads 

x ey) E 
(xa) (x-a)? (y. y)” 
becomes in the present notation be; = (62/01) 7,2. Interchange 
of the variables gives bie = (01/02) r12; the simple coefficient 
of correlation is symmetrical in its two variables, so that 
ə = Viv, While by; and dj, are different. Incidentally it 
appears: that ri = (bis bai). 

Corresponding formulas for partial regression coefficients 
can be obtained immediately. It is apparent on inspection 
that while the coefficients of £s, ---, 2m in the function 
71.23...m = w, as expressed in the form y— AW, are com- 
binations of the x's and the v's, the coefficient A, = biz. ..m is 
merely A. But this -7 is the simple coefficient of regression 
of » with respect to Ww, and as such is expressible in terms 
of the standard deviations of » and w and the correlation 
between them. The standard deviations of ọ and w are 
G1.3...m and G.3...m respectively, and their coefficient of cor- 
relation is ry2.3...n. Consequently 


1 — Asom, 
12,.3---m = 1'19.2--.m. 
92.3-.-m 


By interchange of subscripts—in other words, by consideration 
of the coefficient of regression of £2.s...m With respect to 
71,3...m—it is found that 
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62.3...m 
bot.3...m = eS 1'12.3.--m} 
the definition of the correlation coefficient is symmetrical 
with respect to the first two subscripts. Combination of the 


equations for the b’s gives 
r2.3-..m —= (dr2.8.-.m boia- m). 


In the earlier paragraph to which reference was made above 
the minimum of ((y—4Az)-(y—42)) was evaluated in the 
form (y-y)(1—vr*). In the new notation this result is ex- 
pressed by the equation o? , = o3 (1 —1$,). The corresponding 
equation with subscripts reversed is o? , = o7(1—rj,). Applied 
to the standard deviation of œ = y — A y, regarded as the 
residual of gy with respect to w, it becomes 


g? 


eS ee 
iem o O18 (1 Finean 


Here again it is important to recognize the essential content 
of the formula, as distinguished from the notation in which 
it is expressed. Written down successively for cases of in- 
creasing complexity, with a particular choice as to the dis- 
position of subscripts each time, it yields 


ot, = FU), 


TEES ear A eee 
2 og = Oaa = O (1 — Tis.) 
AEN ee Rahe 
F084 — %.493 = O28 og (1 — Fy os) 
2 as tee se Tea 
OF 9.6m = GF m2a..-m—1 OF smi 5 PO 


and by combination of all these equations 
GES m = oy (1 —rip) (1 —riz.2)(1 —rit.23) css (i FR rim.28.. ai) s 


Reverting to an abbreviated notation for certain residuals, 
but with a modification of that previously employed, let 


@, = 21,34---m; @ = 12.34.--my 


pı = D14...m; Q2 = P2.4.. m; Pa = ZA. -m- 
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By the fundamental theorem about residuals, œ, may be re- 
garded as the residual of ọ, with respect to ps, and œ is 
the residual of ys with respect to ys. Hence the coefficient 
of correlation between œ, and œ is the coefficient of partial 
correlation between p, and gs when ps is held fast, and is 
expressed in terms of the simple correlations between p1, pə 
and p, by the formula previously obtained in the discussion 
of correlations among three variables. If the correlation 
between œ, and œ is denoted for the moment simply by 7’, 
and if the correlations between p, and pə, pı and pz, and ws 
and p, are called respectively rie, rig and rs, then 


r r 
y ri2— ristas 


i e E EF g 


But from the point of view of the dependence of these 
quantities on 21, a, +++. Ems 


ee he ee eee a 
r = Tu m, 112 = T.A: my 118 = 118.4.--m, 128 = 128.4: em 


and consequently 
Y12.4.: -m — 113.4.. ani BE 


712.834. -m = ur —r?, eee (1i— OP 


By the last equation any partial correlation coefficient can 
be calculated in terms of correlation coefficients of lower 
order, that is to say, correlation coefficients involving a smaller 
number of variables, and so ultimately in terms of ordinary 
correlation coefficients. 

Let the partial regression coefficients of z, with respect 
tO £o, L3, +++. Sm be denoted once more by Že, As, +++, Am: 
and let © = hy xat y tyt ++» +dmn am. The coefficient 
of correlation between x, and @ is the coefficient of multiple 
correlation between xı and the set of variables x2, «++. Em. 
It may be represented by 71,23... If the regression coefficient 
of 2, on ® is L, the square of the standard deviation of 
rı — LO is o?(1—r;,,,,). But any constant multiple of © 
is a linear combination of z2, +--+, £m, and by the definition 
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of D no other linear combination of x2, ---, £m can give so 
good an approximation to x, according to the least-square 
criterion as @ itself. Hence it must be that L= 1, and the 
standard deviation of z,— L@® is the standard deviation of 
xı — M, already denoted by 4 .23...m, SO that 

OF 0g. cin = oF (1 ce ee 

Taken in conjunction with an equation previously obtained 
for a? oa. ,,, this shows that 


1 Ti os. “om = (1 rio) (1 ris:3) U Yi4.23) ia (1 "ie 23+ eae | 


a relation from which the coefficient of multiple correlation 
can be calculated. 


4. The geometry of frequency functions 


Apart from the generality which the preceding account of 
the application of geometry to analysis possesses by virtue 
of the fact that at the outset the independent variable may 
be one taking on a finite number, an enumerable infinity, or 
a continuous infinity of values, its substance can be given 
still another setting. In the statistical case, if there are for 
example just two functions concerned, zy = x(k) and yr = y (k), 
the formulas involve the different values of / symmetrically, 
and are not affected if the n pairs of numbers (fk, Yk) are 
permuted, each pair by itself being kept inviolate. That is 
to say, the independent variable serves only to define the 
association of a value of x with a value of y, and has no 
further significance of its own. For dealing with large 
numbers of observations, the problem may be idealized by 
supposing that (within limits, perhaps) any value of x may 
be associated with any value of y, but that some pairs of 
values (x, y) occur oftener than others, to an extent indicated 
by a frequency function (x, y), whose integral over any 
region of the x, y plane measures the frequency of pairs of 
observations (x, y) falling within that region. If x and y 
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are measured as deviations from their respective means, this 
fact is indicated by the conditions 


J xo(a,y)dxdy = 0, [J y glz, y) dx dy = 0, 


the integrals being extended over the range of definition of g. 
The squares of the standard deviations of x and y are then 


ff y (a. y) dx dy Sfr o (x, y) dx dy 
foc, y) dx dy f g(x, y) dr dy 


and the coefficient of correlation between them is 
Sfo y (x, y) dæ dy 


sar 1/2 ct a 
HE g(a, y) dx a| [Sfo p(x, y) dx ay| 


To illustrate the geometry of frequency functions, let the 
case of three dimensions be chosen. Let (a, y, 2) be a non- 
negative continuous function of its three arguments, to be 
regarded as a frequency function for the occurrence of the 
set of values (x,y,z) for three measured variables. To ob- 
viate the necessity of convergence proofs, let it be supposed 
that g is different from zero only over a finite domain of 
three-dimensional space. It may further be supposed without 
essential loss of generality that the triple integral of g over 
the domain where it does not vanish is equal to unity—in 


abbreviated symbolism, fo = 1—which means that all 
frequencies are referred to the total number of cases as 
unit. The assumption that x,y,z are measured from their 
arithmetic means is equivalent to the set of equations 
fog = Sus = fey = 0; no use will be made of these 
equations, except to justify the statistical terminology em- 
ployed. The squares of the standard deviations of x, y, 2 


are fe P, Svs, and fe, since the quantity fo, which 
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would otherwise appear as denominator, is 1. The coefficient 


of correlation between x and y is fay y| [lf y) (Suro. 
and the other coefficients of correlation are correspondingly 
defined. 

As a first step toward the setting up of a geometrical 
representation, let auxiliary notation be introduced as follows: 


è 1/2 . 
xX = 4, lJ x*y| Sis j Xyp = A, 
G EY 1/2 è 
Y = y—(A/o?)X, | rž] =, J x9 aye EZ =Ü, 
1/2 
Z = z — (Ble X — (ChY, |f s] = üj. 


In these formulas Y, FY, and Z, as well as pọ, are to be 
thought of as functions of x, y, and z, and the integration 
as performed with regard to these variables. As the func- 
tional determinant of Y, Y, Z with respect to x, y, z is 1, 
however, the integrals may equally well be taken with regard 
to X, Y, Z, if the limits of integration are suitably adjusted, 
or, what comes to the same thing (since each integrand is 
identically zero except over a finite region), if the integrals 
are extended over the whole of space. As a function of 
X, Y, Z, let v(x, y, 2) be denoted by gı (X, Y, Z). Then, 
with Y, Y, Z as variables of integration, 


[xrs, : f xz, & | Zy, L'y 
Let 


we 


S, a X/o,, 3 e Y/t,, či — Zion, 
and let the quantity c, t, 6, 9 (a, Y, 2) = a Tı ©% Pı (X, Y, Z), as 


a function of the new variables, be represented by ®,(§,, %1, 1). 
In terms of §,, qı, ¢; as variables of integration, the relations 


fano Í £00, = | mi O — 0 
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are satisfied, together with the equations 
[so =f iio, =JoO = 1, 
while SSS ecu. z)dadydz = 1 goes over into 


ot fo, (&, M15 1) dé, dri at, = 1. 


If aS tbigitad, and ag&+bey,+c.6, are any two 
linear combinations of §,, m, 6, 


fea S bd mt eS daS.+ bent oti), = amt hbt eve. 


Finally, for the sake of generality, let §,7,¢ be any 
variables expressible in terms of §,, 7%, ë, by means of a 
(normalized) orthogonal transformation. Let ®, (§,, m, ¢,) 
= P(E, 4, č). As the determinant of the transformation is 
+1, the equations of the preceding paragraph, including the 
last one as formulated with the various sets of coefficients 
involved in the transformation, give 


fro = fico = fro = o, 
fo - fo = fro = feo = fi 


the variables of integration now being £.7,¢. The basis 
of the geometrical interpretation is the possibility of finding 
ë, 4, Ç as linearly independent linear combinations of x, y, 2, 
so that these relations are satisfied. 

The equations expressing ¥, 7, ¢ in terms of x, y, 2 mani- 
festly are in fact linearly independent, and x. y, 2 consequently 
can be linearly expressed in terms of £, 4, ¢. Furthermore, 
any linear combination of x, y, 2 can be similarly expressed. 
Let 

U = wx hyt = 4,§+B74+G¢ 


be any such combination. Then 


[fos dadydz = [ff + By +C,0)? Odsdyds 


= 4i+ Bi+Ci. 
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If 
V = mgt hy +r = A8 +B + Ct 


is any second combination of the same form, 


f f [U Vy dædydz = A, 4s + Bi B+ OCs. 


Let P be the point with coirdinates (Aı, Bı, C1), and Q the 
point (As, Bs, Cs), with reference to a system of rectangular 
axes in three dimensions, and let O be the origin. Then 
fe T?g and Svey, the variables of integration being x, y, 2, are 
the squares of the distances OP and OQ, and 


ferfe S 


the coefficient of correlation between U und V, is the cosine 
of the angle POQ. Through the medium of the equations 
expressing x, y, 2 in terms of S, 4, C, each of the variables 
x, Y, 2, and every linear combination of them, can be associated 
with a definite point in three-dimensional space, in such a way 
that standard deviation and coefficient of correlation have the 
same sort of geometrical meaning as before. From this begin- 
ning the geometrical structure of the theory of correlation 
can be built up along the lines previously followed. 


5. Vector analysis in function space 
There is further scope for the application of simple geometric 
ideas in functional analysis, where the complete picture calls 
for a geometry of infinitely many dimensions. For a single 
illustration (discussed by Lévy, Leçons d'analyse fonctionnelle, 
Paris, 1922, pp. 127-128), consider the integral 


D 
EEEL =f F(x, y, y') da, 


where y is a (suitably) arbitrary function of z, y = f(e), 
and F is a given function of its arguments. Not to enter 
into details with regard to questions of continuity, let it be 
assumed that all the functions that appear in the discussion 
have as many continuous derivatives as are needed to justify 
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the operations performed. If y is looked upon as a point in 
a function space of infinitely many dimensions, or as a vector 
from the origin to the point, 2(y) is a scalar point function 
in that space. Subject to the appropriate conditions of diffe- 
rentiability, let 4 (x) be an arbitrary function vanishing at 
a and at b, and / an arbitrary constant. The familiar process 
of differentiation gives 


d ss t ' La Al , 
an Ot’ »| nae I$ [7 Fy (x, y, y) +w Fy (a, y, y')) de 


b 
E f 4 [ze yy) Sn =. Fy (z, Ys n| dy. 


the last expression resulting from an integration by parts 
performed on the second term in the brackets, with use 
of the fact that y(a) = (b) = 0. Let the function 
Fy—(d/dx)Fy be denoted by ọ (x). Then the expression 
for the derivative may be abbreviated to | yọdæ, or, in 
a notation previously employed, (4-9). The variation hy may 
be regarded as an infinitesimal vector increment of the vector y 
in function space, of geometric magnitude h(y-7)"*. If the 
increment of 2 is divided by this quantity, instead of +, 
passage to the limit gives a directional derivative in the 
direction of the vector y. Its value is (7-@)/(y-7)"?. But 
this can be written in the form 


(7 - 9) 

(7-4)? (p- gy?’ 

from which it appears that the directional derivative in the 
direction y is equal to the quantity (p - p)? multiplied by the 
cosine of the angle between the vectors y and p, being greatest 
when 4 is collinear with p. The “functional derivative” p, 
considered as a vector in function space, thus has the character 
of a gradient of the scalar point function 2. 

The functional derivative can also be obtained formally 
after the analogy of the ordinary representation of a gradient 
in terms of a rectangular coérdinate system. Let it be 
supposed now that y = f(x) itself vanishes at the ends of 


(p - p)? 
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the interval (a, b), and that it is expanded in a series of 
normalized orthogonal functions ux (a), each of which likewise 
vanishes at the ends of the interval, in the form 


Fla) = ay uy (2) + ag us (x) + «+s. 


Let it be assumed further that this series admits differen- 
tiation term by term. The integral 2 is a function of the 
infinitely many variables (a, ds, ---), the codrdinates of the 
point y with respect to a set of rectangular axes in space 
of an enumerable infinity of dimensions. The derivative of 2 
with regard to ar is 
gA 
ý. = Fle; Y: y’) ae 


4 


b 

z if [Fy @, y, y’) we la) + Fy (Œ, y, y') uk (x) dæ 
‘ l 

= $ [Fy (a, Y, y’) — a Fy (£, Y, y')] un (x) dar 


b 
= f p(x) Ux (a) dx. 


This is the Fourier coefficient of g(x) with respect to urls). 
The expression 


x 
vv 62 

Utk (T 
PA 0 ük x (x) 


for a vector having the component 62/6a;, in the direction 
of the corresponding codrdinate axis is the formal expansion 
of the functional derivative p according to the orthogonal 
system (ux). 

An exposition of other elementary developments of the 
vector analysis of function space, rendered concrete by the 
use of theorems on the convergence and degree of convergence 
of certain expansions in series of orthogonal functions, has 
been given by the author elsewhere (Annals of Mathematics, (2), 
vol. 27 (1926), pp. 551-567; Bulletin of the American Mathe- 
matical Society, vol. 32 (1926), pp. 641-643). 
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